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HYPERBOLIC MIXED PROBLEMS 
FOR HARMONIC TENSORS 


G. F. D. DUFF 


This paper may be regarded as a sequel to (1), where the initial value or 
Cauchy problem for harmonic tensors on a normal hyperbolic Riemann space 
was treated. The mixed problems to be studied here involve boundary condi- 
tions on a timelike boundary surface in addition to the Cauchy data on a 
spacelike initial manifold. The components of a harmonic tensor satisfy a 
system of wave equations with similar principal part, and we assign two 
initial conditions and one boundary condition for each component. Our results 
will for the most part be deduced from the analogue of the mixed boundary 
value theorem of generalized potential theory, in which some components and 
some normal derivatives of components are specified on the boundary surface. 

To establish this result in the hyperbolic case, we study linear systems of 
hyperbolic equations with similar principal part under corresponding boundary 
conditions of the two types mentioned. For this purpose we rely on (7) where 
the case of assigned boundary values is treated, and on the modifications (5) 
necessary for the normal derivative problem. We then apply the general 
existence theorem so found to a number of the special problems of harmonic 
tensors. 

The analogy between mixed problems for hyperbolic equations and potential 
theory for elliptic equations has led some authors (6; 9) to study surface 
layer potentials in the hyperbolic case. However, this method has not yet 
been successfully used to find existence theorems for hyperbolic equations with 
variable coefficients. In this paper we shall encounter a situation familiar 
from potential theory—the occurrence of orthogonality conditions on the 
data in cases where the solution of the given problems is not unique. Again, 
however, indirect methods of constructing the solution when these conditions 
are satisfied seem to be necessary. This aspect of hyperbolic mixed problems 
arises only for systems of second order equations as the solutions of all such 
problems for a single equation are unique (5). 

The Maxwell equations can be conveniently written in the notation of 
harmonic tensors, and Theorem V below can be interpreted as an existence 
theorem for boundary problems concerning time-variable electromagnetic fields. 


1. Mixed problems for systems of hyperbolic equations. Let a normal 
hyperbolic Riemannian space Vy be defined by the metric 


(1.1) ds? = ay dx' dx*, 
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162 G. F. D. DUFF 


with Lorentz signature (1, N — 1) and containing one positive term. The 
associate tensor a“ exists and is contravariant symmetric. We study systems 
of M normal hyperbolic linear partial differential equations in M dependent 
variables u,(r = 1,..., 1 M): 


gt OMe 4 Ole . 
(1.2) L, (ur) = a" > aaa + bot + D> Crs he = Cy 


s=1 


Here the coefficients a“ and 5b‘ of the partial derivatives are the same in all 
equations of the system so that the interrelationship of the dependent variables 
is effected only by the coefficients c,, (r,s = 1,..., M). Conditions of regu- 
larity for the coefficients will be discussed below. 

The Cauchy initial value problem for (1.2) involves giving values for the u, 
and their first normal derivatives on a surface S: ¢(x‘) = 0 which is spacelike: 


a(¢) = a” =] => 0. 
x Ox 


This type of problem has been treated by many authors (6, 8, 9, 10) and can 
be regarded as solved. The solution is defined in the domain Ds of exclusive 
dependence on S which is bounded by a characteristic surface G passing through 
the rim C of the spacelike surface S. We note that the geometry of the char- 
acteristic surfaces of (1.2) is the same as for a single equation. 

Let T: y(x*) = 0 be a timelike surface (a(y) < 0) which meets S in the rim 
C. We shall consider only that region D on the positive side of S in a chosen 
orientation of the timelike coordinate. The normal m to T has contravariant 
components a“ dy/dx*. We now divide the equations (1.2) into two sets 


For r = 1,...,, we assign on T values of u,: 
(1.3) u, =f, y=zil,...,p,onT7, 
while forr = p+ 1,...,/ N we assign values of the normal derivatives 
Ou ; 
1.4) —- = g,, = a eeu M, on 7. 
( -_ a r=p+l, 0 


Together with Cauchy data on S, these conditions determine our mixed 
problem. 

There are certain compatibility conditions to be satisfied by the datum 
functions f, and g,; these conditions ensure that u, and its derivatives up to a 
certain order k are continuous across the characteristic surface G which is 
the locus of all discontinuities originating on the rim C = S(\T. In practice 
it is convenient to subtract out the Cauchy data by first solving the Cauchy 
problem, and then subtracting the solutions from the u,. The new problem 
so found has zero Cauchy data and the compatibility condition of order k 
is now the vanishing on C of the derivatives 


af a's, 
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where ¢, a timelike variable, can be taken as equal to ¢(x‘). Thus, the condition 
of order zero applies only to f,. Consider first the case where the system (1.2), 
the data, and the surfaces S, T are analytic. The Cauchy problem can be 
solved and subtracted out. The domain R between G and T remains. To define 
the solutions in R we note that their values are now determined on G and these 
assigned values will be zero since the Cauchy data of the new problem are 
zero. 

A new coordinate system in R is now introduced. Let G, be a family of 
characteristic surfaces meeting T in hypercurves C,, with Go = G, Cy = C; 
and suppose that G, is analytic. Since we can only secure a local existence 
theorem in the analytic case, it is enough to suppose that the G, are defined 
in a neighbourhood of S and T. We now take G, as coordinate hypersurface 
of a new variable ¢ = x’*, and also we set x’*-'! = x’ = (x) so that the 
timelike surface T becomes a coordinate hyperplane. The reduction given by 
Hadamard (6, p. 76), applied to the system (1.2), shows that in the new 
coordinate system the differential equations can be written 

2 M 
(1.5) ay = L;(u,) + p> Cry Uy + Cy. 
where we have dropped the primes on coordinates and coefficients. Here the 
new dependent variables u, are equal to 


u, exp[f b¥ dx¥—] 


as in Hadamard’s reduction, and the operator L,(u,) contains no differentia- 
tions with respect to /. 
The auxiliary conditions are now 


(1.6) u, = 0, g=zl,...,M:t=QO, 
and 
(1.7) ur =f, y=1,...,p;x=0, 
with 
Ou, N 
(1.8) — = U, + 9,, y=z=b+1,...,M;x = 0. 
on 


We shall suppose that the compatibility conditions of order up tok > N + 2 
are satisfied, and it follows as in (5) that we can subtract from the u, certain 
functions, analytic in R and C* across G, which satisfy the above boundary 
conditions. This means that in (1.7) and (1.8) we can assume f, = Oand g, = 0, 
while the non-homogerieous terms e, in (1.5) absorb the extra terms appearing 
in the differential equations. 

From (5, Lemma I) we see that the normal m to T is never tangent to the 
characteristic surfaces G, Consequently we may now write the homogeneous 
boundary conditions in the form 











u, = Q; ao TETT 

(1.9) ou N-1 ou 
= 7 Boa + hu,, r=p+i1,...,M. 

k=1 09 


Here we have solved for the derivative with respect to ¢ in the second group. 
These conditions apply for x = 0. 

Let the unknowns u, and all other functions in (1.5) and (1.9) be expanded 
in powers of ¢: e.g. 


u, = > Umt", L(u,) = D Lin(u) f". 
n=0 


n=0 


By a subscript attached to a function we shall mean the coefficient of ¢" 


in such an expansion. We now determine the u,, recursively form = 1,2,.... 
From (1.5) we have 
du, 
(1.10) ne = Lor (t-1) + --- + Qo Creo thant +... +e ly 
s 
where the terms omitted contain the u,;,...,uUx,... up to the u,,~», and 
their derivatives with respect to x',...,: <2 and x = x*-'. The boundary 


conditions (1.9) now read 


Um = 9, y=il,...,P; 
(1.11) nw-1 au 
1 Urn, = Z Bio + ho tent ae r=p+1,...,M. 
k=l 


To satisfy (1.6) we take u,o = 0. Thus the u,, are determined by integration 
with respect to x. The expressions so found contain additions, multiplications 
and integrations, all of which operations preserve the relation of dominance 
between power series in the variables x', . . . , x*—', t. Thus by dominating the 
coefficients in (1.5) and (1.9) we would find a series solution dominating the 
original one. 

By choosing a single function U to dominate all of the u,, we can reduce the 
proof that the dominating series converges to the case considered in (5). 
This can be done by choosing a single dominating coefficient for each of the 
combinations 


>, Cre (s=1,...,M) 


which would appear in (1.5) if all «, were equal, and a common dominant for 
the e,. The dominating series in (5) has a positive radius of convergence in 
each variable, and has positive coefficients, so that (1.6) and the first of (1.9) 
are dominated. This series also satisfies for all x',...,x%-',t a relation 
(5, 2.27) which is equivalent to the domination of the second group of conditions 
(1.9). We may therefore conclude that a local analytic solution of (1.5), 
(1.6), (1.7), (1.8) exists, with a radius of convergence independent of the 
datum functions. Referring back to the original problem, we see that the 
solution here is analytic except possibly on G where it is of class C*. 
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To extend the domain of the solution, or to lighten the hypothesis of 
analyticity, certain estimates of the square integrals of the solutions and their 
derivatives are needed- Except for one point the procedure follows that in 
(5; 7), so we shall not give details. The procedure involves integrations by 
parts carried out over (1.2) after multiplication by a first derivative of u 
in a certain timelike direction. A different choice of this direction is necessary 
for the two groupsr = 1,...,pandr=p+1,...,M. Forr=1,...,p 
with the Dirichlet-type condition as in (7), this direction must cross T inwards 
with increasing time, while forr = p+ 1,..., / VM and the normal derivative 
condition as in (5), the direction must cross T outwards with increasing time. 
The result of these calculations is estimates of the k-norms ||u,||, as defined 
in (7) in terms of the functions e,, the Cauchy data, and the given data in 
(1.3) and (1.4). Applying the lemmas of (7) to secure convergence we are 
able to extend the domain of definition of the solution in the direction of the 
future, and to show the existence of solutions in the non-analytic case. 

Finally, we shall remove the compatibility conditions on the rim C, with 
the exception of the first of these conditions for the group r = 1,..., p. 
This is done, as in (5, §6), by writing u, = u,,+9,, and requiring that 
L,(v,) be of class C* in the entire domain while v, satisfies boundary conditions 
such that the conditions for u,, are compatible of order k. We assume that the 
differential equation and data are of class C*, and construct v, as the first k 
terms of the series expansion in the analytic case. The problem for 1“, is now 
seen to be C* and so is reduced to the preceding case. 

We state the general result as an existence theorem; the following sections 
contain the applications to harmonic tensors. 


EXISTENCE THEOREM. Let the coefficients in (1.2) be of class C* (k > N + 2) 
and let the Cauchy data together with the data in (1.3) and (1.4) be of class 
C*+% Let the data in (1.3) and (1.4) satisfy the compatibility conditions up to 
order q inclusive, gq < k — N. Then there exists a unique solution u, of (1.2), 
(1.3) and (1.4), together with the given Cauchy data, and this solution is of class 
C'-" except across G where it is of class C*. 


The hypotheses stated in the above theorem have been simplified and will 
be applied with g = 1 or 2 to the theory of harmonic tensors. However less 
restrictive hypotheses, and a slightly stronger conclusion, are possible as the 
situation is effectively the same as in (5, §6). 

We shall use the uniqueness of the solution several times below, and so we 
state here the conditions of regularity which a solution u,, satisfying the 
homogeneous equations and auxiliary conditions, must satisfy for this purpose. 
Let u, be of class C?, except possibly across G, where continuity only is assumed. 
Since the homogeneous problem is compatible of every order k, it follows that 
the first and second derivatives of u, across G are also continuous (5, §1). 
Thus u, is C? and the estimates referred to above show that u, = 0. 
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The statements of the theorems below will specify the value of g in each 
case. For brevity, we now assume once for all that k > N + 2 and all solutions 
are of class C*-" except possibly across G. 


2. Application to harmonic tensors. We recall here the main points of 
the necessary notation (1). A differential form 


(2.1) @ = $4...., dx A... A dx’. 


where summation with i; < ... < 7, is understood, is based on a totally skew 
symmetric tensor ¢;,,__., ty of rank p. The differentials dx‘ are combined by 
exterior multiplication which has the anti-commuting property dx‘ A dx’ = 
—dx/ A dx‘. The differential of @ is a form of degree p + 1: 


(2.2) do = (doy.....1,) A dx" A... A dx”. 
and 
ie oe 
(2.3) (dq) 4... tp 4s = eT atte ax? Pj... 5p)» 


where I is the skew symmetrized Kronecker delta of rank p + 1, and the 
brackets indicate j; < ... < j,. The dual *¢ is an (N — p)-form with com- 
ponents defined by contraction with the permutation symbol of order N. 
Then the coderivative 5¢, of degree p — 1, is defined by 


bd = (—1)*? +t ladeg. 
The identities 


(2.4) d.d@ = 0, aah = (—1)*?+? g, 6.6@ = 0, 
hold. 

If dp = 0, ¢ is said to be closed, and if ¢ = dx, derived; and dually 6¢ = 0, 
@ = 5x describe coclosed and coderived forms. 

The Laplacian operator A is given by 


(2.5) A = 6d + dé, 


and if the equation A¢ = 0 of harmonic tensors is written out in component 
form it is of the type of (1.2), with M = (3) equations. The coefficients c, 
then consist of contractions of the Riemann curvature tensor of the metric (1.1). 

On any hypersurface S with equation, say x” = 0, a p-form @ induces a 
p-form t@, which contains those components free of the index m, and is known 
as the tangential part of ¢. The residual or normal part, is denoted by n¢, 
and has the factor dx”. Thus we shall write 


(2.6) = th + nd = th + id A dx", 


which decomposition is invariant only on S. The dual interchanges ¢ and n; 
thus *#¢ = m+, «n = t*. If t@ and n@ are assigned on S, then td@ = d std and 
nb are determined, but nd¢ and té¢ can be assigned independently. 


of 
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From Stokes formula 


(2.7) f ae = J ¢, 
Cc oc 


which is valid for any (p + 1)-chain C with boundary dC, we observe that the 
period integral {.@ of a closed form ¢ depends only on the homology class of 
Z. A closed form with vanishing absolute periods on a manifold with boundary 
is derived (2, Theorem 4). 

To distinguish among the various cases which follow, we shall give to each 
problem the name of the problem in elliptic generalized potential theory with 
the corresponding boundary conditions. The Cauchy data t¢, n@, tid, nde 
on an initial surface S will be assigned in each case. We may allow the initial 
surface S to be a multiply-connected manifold, bounded by a manifold 
C = S/\T. We shall, however, assume that any non-bounding cycle of the 
region enclosed by S and T is homologous to a cycle of S, and that any cycle 
of T is homologous to a cycle of C; and vice versa. In effect, we take the domain 
to be topologically uniform in the time dimension. The data will always be 
assumed to satisfy the conditions of the preceding existence theorem. 

The Dirichlet problem involves values of t¢ and n@ on T; thus we shall 
invoke the existence theorem with » = M so that all equations belong to the 
first group. Then we have 


THEOREM |. There exists a unique harmonic form @ with given Cauchy data 
on S and given values of th and n@ on T. 


Here the compatibility condition of the general existence theorem implies 
that ¢@ and n@ given on T should have values on C determined by the Cauchy 
data t@ and n@ on C. 

In the mixed problem of generalized potential theory the boundary data 
are values of t@ and fé¢, or alternatively and dually (3), of n@ and nd¢. From 
(2.3) we deduce the formula 


(2.8) fid@ = (-1)’*" HS + de Ke, 

Ox 
the component form of which is found by setting i,,; = m and separating the 
term 7 = m from the others. Thus, if m@ and nd@¢ are given, the first normal 
derivative of t@ is determined and so the data in this case amount to values 
of nd and normal derivatives of components of t@. We therefore choose the two 
groups of the system (1.2) accordingly, and conclude with the 


THEOREM II. There ex*s's a unique harmonic form with given Cauchy 
data on S and given valués of n@ and nd@ (or td and tid) on T. 


Here the compatibility condition on C requires that n@ (or /@, as the case 
may be) should have values on C in accordance with the Cauchy data. We 
shall later need solutions of class C' or C? and so will assume the compatibility 
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conditions of order 2 and 3 as necessary. These restrict both m@ and nd@ 
(or t@ and té@) in the neighbourhood of the rim C. 

The uniqueness of this solution in the hyperbolic problem is in sharp con- 
trast to the elliptic case where there exist eigenforms in number determined 
by the p-dimensional connectivity (Betti number) of the domain. 

In both of these theorems we could state results for the non-homogeneous 
differential equations 
(2.9) Ad = p, 


by solving a Cauchy problem for (2.9) in a larger domain and subtracting out 
the solution. 


3. Particular cases of the mixed problem. The result of Theorem I! 
admits of a number of special cases in which the governing differential equation 
Ad = pis replaced by a pair of equations. The boundary conditions also take 
various forms; these have been studied in the elliptic case (4). We begin with 
co-closed harmonic forms which satisfy 6¢ = 0, dd¢ = 0. The corresponding 
non-homogeneous equations will be treated. 

For our first theorem relative to coclosed harmonic forms, we assign initial 
data, indicated by the subscript 0, and values for the tangential part of » 
on T. The differential equations are 


(3.1) bdo = p, 56 = <a, 


where p = p, and ¢o = a,_1, are assigned coderived forms. The initial values 
to be given are such that together with (3.1) Cauchy data for the solution 
can be obtained. Thus we prescribe t¢o, t*¢ 9 and t*d@o, indicating initial 
values with the subscript 0. In order that these data should be compatible with 
(3.1) we must require 
(3.2) d sl*hy = (—1)*% teao, 

d sttdgy = (—1)*% txpo. 


We then set ti¢@) = to» and so obtain complete Cauchy data. 
In the boundary condition 
(3.3) te = 8, on 7, 


where @ is a given p-form, the first two compatibility conditions must be 
satisfied. That is, @ and its first timelike derivative on C shall have values 
given by the Cauchy values there. 


THEOREM III. Let too, tbo and t#dgy be assigned on S subject to (3.2). Let 
the boundary condition (3.3) satisfy the first two compatibility conditions on C. 
Then there exists a unique solution of (3.1) satisfying these conditions, and this 
solution is of class C'. 


To prove this we set up the mixed problem 


(3.4) Ad = p+dae, 
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with boundary conditions 
(3.5) to = 0, tid = to, on 7, 


and the Cauchy data introduced above. The values of t@ are compatible for 
k = 0 and 1, while the values of té¢ are compatible for k = 1 since 5¢9 = o 
in view of the first of (3.2) and the definition f6¢) = too. The general existence 
theorem now asserts that ¢ exists and is of class C’. 

We now define 


(3.6) yy =b¢-—<0 
and note that y is continuous across G and is elsewhere of class C*-*—'. Also 
dy = did — do = p — bdo 


is coderived and so édy = 0. Since by = 6.56 — 50 = 0 it follows that Ay = 0, 
We now show that y has zero Cauchy data. From fé¢» = too, we get Wy = 0 
and likewise 

tayo = t*dho — tagg = (-—]1 i d sl*do — l*eayg = OV, 
frome first of (3.2). Also 


tadyo ta(p — ddd) = tepyo — (—1)* tdeddy 


tepyp — (—1)*% dgteddy = 0, 


from the second of (3.2). Finally té~. = 0 since y is coclosed, and thus the 
Cauchy data for y are zero. Turning to the boundary conditions, we find 
ty = 0 on T from the second of (3.5) while téy = 0. Thus y satisfies homo- 
geneous boundary conditions of the mixed type. From the uniqueness of the 
solution in Theorem II we conclude that y vanishes identically. Together 
with (3.4) and (3.6) this establishes (3.1) and concludes the proof of the 
theorem. 

Another set of boundary conditions applicable to the differential equations 
(3.1) are found from the dual mixed problem: thus 


(3.7) tap = £, tadd = n, on 7 
are specified. The restrictions 

(3.8) drt = drlem = (—1)*t#ig = (—1)% tao 

and 

(3.9) dyn = drtxdd = (—1)%t*idd = (—1)% tap 


are necessary in view of (3.1). The initial values are as in the preceding theorem 
Again, we require that the compatibility conditions of orders k = 0 and 1 for 
€ and order k = 1 for » should hold. Thus the value and first derivative of £ 
and the value of 7 on C must agree with those calculated from the Cauchy 
data. 
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THEOREM IV. Let too, t*@o and t*ddo be assigned on S subject to (3.2). Let 
the data in (3.7) satisfy (3.8) and (3.9). Then there exists a unique solution of 
(3.1) satisfying these conditions, and this solution is of class C'. 


For the proof we again solve (3.4), this time with boundary conditions (3.7) 
and the above Cauchy data. The compatibility conditions of orders k = 0 
and 1 are satisfied and a C' solution ¢ exists, by the dual of Theorem II. 
Again, let ¥ = 6¢ — o, which is continuous, and let us show that W vanishes. 
As before, Ay = 0 and y has zero Cauchy data. On T we have 


tay 


Il 


tx(dd — co) = (—1)* drted — tea 
- (—1)* drt — te¢ = 0 
from (3.8). Also 
tady = tx(did — do) = t*(p — bdo) 
tep — (—1)*% drtedo 
tep — (—1)* dry = 0 


from (3.9). Thus y satisfies the homogeneous dual mixed boundary conditions 
and so must vanish everywhere. This proves the theorem. 

We turn now to the differential equations associated with harmonic fields — 
forms which are both closed and coclosed. The corresponding nonhomogeneous 
equations are 


(3.10) d@=p, 66 =a, 


where p = p,4: must be derived and o = o,_,; coderived. 
For initial values we now assign t¢@») and {f*¢» subject to the necessary 
conditions 


(3.11) d,tdo = tddo => tpo 
and 
(3.12) dt*po = tdepo = (—1)%t¥ido = (—1)*tea0. 


We can now obtain Cauchy data for this problem by adjoining to these values 
tedgoo = t*po and tidy = too. 

The appropriate boundary condition is 

(3.13) to = §, 

where the necessary condition 

(3.14) drt = tdod = Ip 


holds. This restriction on ~ will be examined in more detail below; we remark 
that £ shall also satisfy two compatibility conditions in relation to the above 
Cauchy data. 


THEOREM V. Let too and t*@o be assigned on S subject to (3.11) and (3.12). 
Let & in (3.13) satisfy (3.14) and the first two compatibility conditions. Then 
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there exists a unique solution of (3.10) which satisfies these conditions and this 
solution is of class C'. 

To prove this we establish the mixed problem 
(3.15) Ad = ip + do 
with the Cauchy data given above, and with boundary conditions 
(3.16) to = §, td = to. 
The data being compatible of orders k = 0 and 1, the solution ¢ is C'. Let 
(3.17) ¥=i¢-—0, x =do-—p:; 


both of these forms are continuous across G. 
We will show that both y and x vanish identically. Taking first y, we have 
by = 6.56 — bo = Oand 


dy 


so that édy = 0 and Ay = 0. The Cauchy data of y are now to be found. 
First, tho = tédo — too = 0 from the construction of tédo. Next 


ll 


did — do = ip — ddd, 


tayo = teddo — lean = (—1)*% ds tedho — leon = 0, 
from (3.12). Then té~o = 0 since dy = 0, and finally 
tadyo = t* (dpo — bdo) = (— ] pv ds t* (po — dd») = 0, 


according to the construction of t*d¢@o. Turning to the boundary conditions 
satisfied by y, we have ty = té¢ — to = 0 from the second of (3.16), while 
tt@ = 0. Thus ¥ satisfies homogeneous mixed boundary conditions as well and 
so vanishes as required. This proves the second of (3.10). 

To show that x is zero, we have dx = d. do — dp = 0, while 


5x = ddd — ip = do — did = — dy = 0 
so Ax = 0. The Cauchy data for x are: 
txo = tid — tp = dstd — tp = 0 


by (3.11); 
t* xo = txddo -_ te po = () 


by the way t*d¢» was defined; 
ttxo = 0, t#dxo = 0. 
The boundary conditions for x are 
tx = tdd — tp = drtd — tp = drt — tp = 0 


by (3.14); and #x = 0 since x is coclosed as demonstrated above. Since x 
now satisfies all conditions of the homogeneous mixed problem it vanishes 
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identically and this proves the first of (3.10). Thus ¢ satisfies both differential 
equations and the proof of the theorem is complete. 

The restriction (3.14) on the datum form £ will now be examined in more 
detail in order to find just what components can be freely assigned on T. 
For simplicity we shall suppose the differential equations homogeneous and the 
initial values zero. We assert that in these circumstances (3.14) together with 
the first compatibility condition imply that £ is a derived form on T. Indeed 
if dpt = O the periods of ¢ on p-cycles of T are well defined, and by our general 
topological hypothesis of §2, any cycle of T is homologous in T to a cycle of C. 
On this cycle the period is zero since § vanishes there. By (2, Theorem 4), 
it follows that — is derived on T, say §&, = dp f>-1. 

Let C, be a family of spacelike (N — 2)-dimensional surfaces covering 7, 
with Co = C, and let & be decomposed into tangential and normal parts 
relative to C,. We shall, for convenience, refer to these as the spacelike and 
timelike parts of ~, respectively. We now show that due to (3.14) the spacelike 
part is determined by the timelike part. Indeed 


OE 4y...4 


—...%...20 
ot 


(dré) ti,... >» > 
where the subscript ¢ refers to the timelike parameter of the family C,; and 
where the terms not shown are derivatives of components of the timelike 
part of & Since ~ is given for t = 0 by the compatibility condition, we may 
determine 

Fy... ty Wiper nn ty HI 


by integration over ¢. That is, the spacelike part, having (“>*) components, 
is determined by the timelike part, which has (37?) components. That the 
components of the timelike part can be freely assigned is evident from the 
component form of §, = drf,_1, namely 


OE ty... tp- 
Sets... tpas = — oe or ee os axe 


since the time-derivatives can be chosen at will. 
Maxwell’s equations in empty space have the form 
dF.=0, 6F2, = Ji, 
where, in locally Cartesian coordinates with Lorentz metric, 
F, = Hidy A dz + Hiedz A dx + Hydx A dy + (E,dx + Exedy + E;dz) A dt, 


and 


J, = Jdx + Jy + Jdz + pdt. 


Theorem V applies to determine F, if J; is known and the auxiliary conditions 
are specified as follows. For t = 0, assign ¢F. and t*#F2, that is, values of the 
components of E and H. These are subject to (3.11) and (3.12) which become 
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divH=0, divE =p 


(p denoting charge density), respectively. Thus two components of each 
vector can be freely specified. For the boundary conditions on a surface with 
local equation x = 0, we have 


tF, = HidyAdz + ExdyAdt + ExdeAdt, 


the first term on the right being the spacelike part. Thus the two tangential 
components of E are to be assigned, and determine the field uniquely. Applying 
the theorem to the dual of F:, we should take the tangential components of H/. 
. As another direct application of Theorem II, we consider the equations 


(3.18) bd@ = p, did =o, 
associated with the class of harmonic tensors called biharmonic fields (4). 
Here p = p, is coderived and o = ga, derived. For jnitial values we now assign 


the four Cauchy data fo, t*$o, t*ddo and té@o. However the latter two are 
; subject to the conditions 


(3.19) d gtido = tao; dgteddo = (—1)%tapo. 
The boundary values 
(3.20) to = t, téd = 7, on T, 
shall be required, where 7 satisfies 
(3.21) dyn = te, on T. 


In the proof we shall need a uniqueness theorem for second differentials of ¢ 
and so we assume compatibility conditions for — and of orders k = 0, 1 
and 2 on the rim C. 


THEOREM VI. Let Cauchy data on S be assigned subject to (3.19). Let § and y 
, satisfy (3.21) and the first three compatibility conditions on C. Then there exists 
a unique solution of (3.18) satisfying the initial and boundary conditions, and 

this solution is of class C. 


The theorem is proved by constructing ¢ as a solution of 
(3.22) Ad = p+a, 


with mixed boundary conditions (3.20). In view of the compatibility condi- 
tions assumed for ~ and 7», the solution is of class C? across G. Now let 


vy = ddd — p = o — did; 
this form is continuous, and being both derived and coderived is also harmonic. 
Now the Cauchy data for ¥ vanish since 
tho = tay — drtido = 0, 
tayo = txdddo _ t*po = (— ] Vd stxddy = t*po, 
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by (3.19); while t#dy> and fy are clearly zero. On the boundary 7, we have 
ty =le — d rtio =tle — drn = 0, 


from (3.21), while féy is zero identically. Thus y vanishes and (3.18) hold, 
which concludes the proof. 


4. Orthogonality conditions. The problems studied in this section do 
not have unique solutions and moreover the data must satisfy certain integral 
conditions of the orthogonal type familiar from potential theory or integral 
equations. As above, we consider a spacelike surface S bounded by its inter- 
section C with a timelike surface 7. We denote by = any spacelike surface 
lying in the region of definition of the solutions, such that S, 7, and = bound 
a portion of the region. More precisely, we denote by S; and 7; the parts of 
these surfaces which, together with 2, have this property. 

The Neumann problem of generalized potential theory concerns solutions of 
A@ = 0 with given values of ¢*d¢ and té@ on the boundary. We assume these 
conditions to hold on T and take zero Cauchy data for simplicity. Now any 
harmonic field (dr = 0, 6r = 0) with zero Cauchy data is an eigenform of this 
problem and from Theorem V we see that an infinity of linearly independent 
eigenforms 7 exist. In the elliptic case these furnish the orthogonality condi- 
tions directly. If we write down Green’s theorem for the domain bounded by 
Ss, Ts; and = in the form 


(4.1) (dr, dq) + (67, 56) — (7, Ad) = (r A ado — 56 A #7), 


Jar Tz+ 
where (a, 8) denotes the volume integral of a A +*8, (1), we see that the terms 
on the left vanish, and that the integral over S is zero since ¢ nas zero Cauchy 
data. Thus, if r is a harmonic field, with zero tangential and normal part on 2 
defined in the region bounded by S;, Tz; and 2, the integral over = disappears 
and we have 


(4.2) } a (r A «do — 56 A *#r) = 0. 
z 


These are the orthogonality conditions of the Neumann problem and they 
arise from the eigenforms of the backward problem for harmonic fields with 
“final” surface 2. We remark that in (4.2) ¢*7 and fr are not independent since, 
by Theorem V, values of tr alone determine 7 as a harmonic field. 

The problem to be solved is then to find a harmonic form ¢ with zero Cauchy 
data on S and 


(4.2) tedo = §£, tib = 7 on 7, 
where 
(4.3) 4 (rANE—nA*r) =0 


for every backward harmonic field r vanishing on 2, for every such surface 2. 
Let — and 7 vanish on C. 


Oe] 
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~I 


We begin by solving the mixed problem with té@ = 7; thus let Ag, = 0, 
let ¢; have zero Cauchy data, let 4¢; = 0 on T and t8¢, = n. By Theorem II 
there exists a C' form ¢; satisfying these conditions. Now also 


Jorg A addr — 861 Aor) = fi. (A addr — 9A #7) = 0, 
Tz JT: 


and if we subtract these from (4.3) we get 


(4.4) Sr? A (& — tedg;) = 0. 


The following lemma is now necessary. 


LemMMA. Let ¢ be a form of degree N — p + 1 defined on T, and zero on 
C= S(\T. Let 


(4.5) Ja rata 


for all closed forms r defined on T, and vanishing on Cz = T (\ . Then ¢ is 
closed on T: drt = 0. 


To prove this we let y be a solution of 
idy = 0, ty = 0, 


having zero Cauchy data on &, and defined in our region. According to Theorem 
III, we may choose ty on T arbitrarily (of class C*) except that ty and its 
first timelike derivative must vanish on Cy. Then + = dy is a harmonic field, 
with zero Cauchy data on = and so vanishing on Cy. Also fr is closed on T 
sO we may insert tr = fdy in (4.5). Since S; + T; + 2 bounds a region, we 
have by Stokes formula 


( = t = : t = 7 
) Sscarep ndtv A0) Sset tees 8 * Sa regnlt A det 


Here we have extended the definition of ¢ to a form vanishing on S and defined 
throughout the region. By (4.5) and in view of our hypotheses, the first integral 
on the right vanishes. The second reduces to an integral over 7; and so we 
have 


Set A drt = 0. 


But ty is arbitrary except for its values on Cy; we conclude that drt = 0 as 
required.From (4.4) we now infer 


(4.6) dr(t — t*d¢;) = 0, 


since we can construct harmonic fields 7 satisfying the condition of the lemma, 
and since § — t*d@, vanishes on C. By Theorem IV we construct a second form 
x, with zero Cauchy data on S, such that 
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(4.7) idx = 0, bx = 0, 
and 
(4.8) tex = 0, tadx = & — teddy, on 7. 


Here the necessary conditions (3.8) and (3.9) hold, the latter in view of (4.6). 
The data in (4.8) are also compatible of the order stated in Theorem IV since 
§ — t#d@, vanishes on S. Therefore such a C' form x exists. 

Now let 
(4.9) $= oi t+ x; 


we have A¢ = 0 while ¢ has zero Cauchy data on S. Also 


(4.10) tid = tid, + tix = 1 +0 = 7, 
while 
(4.11) tedh = ted, + t#dx 


= tedg, + & — teddy = E. 


This proves 


THEOREM VII. There exists a harmonic form $ with zero Cauchy data satisfy- 
ing (4.2) subject to the orthogonality conditions (4.3); and this form is C'. 


The above proof also shows that the orthogonality conditions are sufficient 
to determine d7t when 7 is given. Therefore the global conditions (4.3) are 
equivalent to this pointwise condition, which however cannot be stated 
explicitly without introducing as we have done a solution ¢; of the mixed 
problem. 

When the data é and 7 satisfy g compatibility conditions, which we shall 
not need to write out explicitly, then the solution constructed above is of 
class C*. We shall take g > 2 in the following second theorem for the bihar- 
monic field equations. The differential equations shall be (3.18) and the 
Cauchy data are again subject to (3.19). The boundary conditions are (4.2) 
where ~ and 7 each satisfy two compatibility conditions with respect to the 
given equations and initial values. In addition the conditions 


(4.12) drt = (—1)*% t*p, drn = tle 
are clearly necessary. 

THEOREM VIII. Let Cauchy data on S be assigned subject to (3.19). Let 
£ and » satisfy (4.12) and compatibility conditions of orders k = 0 and 1 on C. 


Then there exists a solution of (3.18) satisfying the initial and boundary conditions, 
and this solution is of class C?. 


To prove this we again consider the equation A¢@ = p + o with the given 
Cauchy data and the boundary conditions (4.2). For this nonhomogeneous 
equation the orthogonality conditions take the form 


(4.13) (op + 0,7) + i een (r A edo — 56 A #r) = 0. 





Ce 


el 





—~- — 
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As in (3.18) we must have p coderived and o derived; from (3.19) we see that 
we can write p = 60 and o = dt where 


(4.14) tt = tido, (—1)*% t+ = taddo. 


Making use of Stokes formula we transform the volume integral and obtain 
for the left hand side in (4.13) 


(4.15) | A [todd — te6] — [téd — tf] A er) = 0. 


According to (4.14) the integrand is zero over S;, and from (4.12) it follows 
(as a converse of the lemma) that the integrals over 7; vanish. Thus the 
orthogonality conditions are satisfied and a solution, which will be of class 
C?, exists. We can now set 
vY = b6.do — p = o — did 

and show that y is harmonic with zero Cauchy data. From the second of 
(4.12) we find tY = 0 on T while 6y = 0 implies téy = 0 on T. Thus y¥ vanishes 
identically and this proves the stated result. Though these differential equations 
are very little stronger than the single Laplace or Poisson equation, the 
pointwise conditions on the data can be stated directly and no orthogonality 
condition intervenes. The eigenforms of this problem are again the harmonic 
fields with vanishing Cauchy data. 

As a final example we discuss another problem wherein orthogonality 
conditions appear, and for simplicity we consider the homogeneous equation 


(4.16) Ad = 0 
with zero Cauchy data. Here we specify on T the values of 
(4.17) tp = §,, tedd = ny-p-1- 


The number of components so specified is not in general equal to the number 
of components of ¢, and indeed if p = N (4.17) are empty conditions. How- 
ever, we shall prove that the appropriate conditions of orthogonality are 
necessary and sufficient for the existence of a solution. The eigenforms satisfy 


diy = 0, dy =0, 


with ty = 0 on T and any such y is an eigenform. This is demonstrated by 
showing that an eigenform is necessarily closed, as follows: dy is harmonic, 
with zero Cauchy data, vanishing normal part on 7, and with t#d.dy = 0. 
Hence dy vanishes by the uniqueness in Theorem II. 

The orthogonality conditions arise from the eigenforms p of the dual back- 
ward problem based on 2. These satisfy 


(4.18)  édp =0, dp =0, 
with t#p = 0 on 7, and with zero Cauchy data on 2. The backward form of 


Theorem IV shows that infinitely many linearly independent eigenforms p 
exist for each surface =. Thus the orthogonality conditions 
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(4.19) Jr. A sdp — p An) =0 
take effect. 


THEOREM IX. Let Cauchy data zero be assigned on S. Let the data in (4.17) 
satisfy compatibility conditions of orders k = QO and 1. Then there exists a harmonic 
form satisfying (4.17) tf and only if (4.19) holds. 


We have shown that (4.19) is necessary. To prove that it is sufficient, we 
begin by constructing a harmonic form ¢; with zero Cauchy data and with 
t#dd, = n. By Theorem II such a form exists (with ¢*¢,; = 0) and is of class C'. 
The orthogonality conditions apply to ¢; so that 


‘. (di A dp — p A #d¢;) = 0, 
z 


and on subtraction we get 


(4.20) | (& — tdi) A *dp = 0. 

Tz 
By Theorem IV, or its dual, we can, however, choose t*dp to be an arbitrary 
closed form which vanishes on C;. The lemma now shows that 
(4.21) dr(é — td,) = 0. 
Again making use of the dual of Theorem IV, we see that it is possible to 
construct a form x with 

dix = 0, dx = 0, 
having zero Cauchy data and satisfying the boundary conditions 

tx =§— th, téx = 0. 
The necessary condition dual to (3.8) is satisfied here in view of (4.21), while 
the remaining conditions are clearly fulfilled. Now we take the C' form 
od = oi + x 

and observe that A¢ = 0, while 


t¢=td +&E—thi =E 
and 

tad = txdo, + 0 = 7. 
This completes the proof. 

As remarked above, these boundary conditions are empty if » = N. For 
pb = 0, on the other hand, the orthogonality conditions are highly restrictive 
and determine either one of £ and 7 in terms of the other. This set of boundary 
conditions has the property that its adjoint set, in the sense of the theory of 
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differential operators, is the dual set in the notation of generalized potential 
theory. 


8. 
9. 
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SOME GENERATING FUNCTIONS FOR 
LAGUERRE AND HERMITE POLYNOMIALS 


FRED BRAFMAN 


1. Introduction. The Laguerre polynomials L,“ (x) have the following 
hypergeometric and Rodrigues representation (3, pp. 188, 189): 


a (1 + a)n —nN; 
(1) L,” (x) = se ee FA, un c|, 
(2) L(x) = Dife* x"**, 


The Hermite polynomials H,(x) have the representations (3, p. 193): 


(3) H, (x) = (2x)" 2Fo(—43n, —4n + 3; —; —x”), 
(4) H,(x) = (—1)"e&" Dre™. 


This note will present several generating functions for the L,@(x) and 
H, (x). The result (5) was recently obtained, by an entirely different method, 
by Weisner (7). The other following results are believed to be new: 


_ a-i-ete a 4\-8 —xt a; vix —s 
(5) (l-4) (1 — ¢ + ot) exp(—*) .F] , amass 
a; 


me Fangs ole L(x); 
n=0 





la. 3 2. 7 
(6) [1 + 2ut(x — t)]* exp(2xt — f°) oF 2% 22 F 3: — 


— ;[1 + 20t(x — 2) 


mm .F) — of H, (x); 


n=0 





a~o-a) —s Ss — 
(7) dexp{ =! at) (i — ey") 1 — t = ot) PA, +a;(1 — =| 


_ a; tux ( 
oe toe A, tee} 
n 


= + }, $a, 4 3a +} 2; | (a) {": 
athe, = 14+40;° 1% OF 
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—l-a xt _2vt —;__—2vxt 
®) 10-4 exo; = deol; _ ) FA +a; (1 = 7d 
2ut —;  22vxt \ 
+ exp( = Drie cool 


—}n+ 4; 2 (a); mn. 
-E{,53 b+ te, eben adel 


a ee ae. | 
(9) 4 exp(2xt at vt(x — t))~“* Fl — ;fl—o(x—oFP 


. —a 24, ha +: —vt \ 
+[1 + ot(x — 2)] P|! ee a 
~¥ 7] -* — $n + 3, 3a, aed bgp | Hale 


! 
n=0 n 


(10) exp[2xt — ¢ — vt’) cosh[2vt(x — #)] 


@ _— | . n 
=) Pl soll cd si | ie 


n=0 3; n!' 


(11) exp(2xt — #°)(1 + ut!) iF | 6 tio — 9° 


4s 1+ut 
; Digs : 
-.F] in, —}n + ta; | H(z) 
n=0 3; n 


Equations (6) and (9) give divergent generating functions, but the rest are 
convergent for ¢ sufficiently small. In equations (5), (6), (7), (9), (11), @ is 
arbitrary. It should be noted that putting a = — k (k = 0,1,2,...) in (5) 
and (7) reduces them to a form involving the general Laguerre polynomial 
itself, doing this in (11) reduces the left side to a form involving the Laguerre 
polynomial for the special case a = — 4, and doing this in (6) and (9) reduces 
the left sides to forms involving the Hermite polynomials. It should further be 


noted that the 
Pe . 
mal qn, , + big], 


on the right side of (10) are essentially Tchebycheff polynomials of the first 
kind in the argument (1 — v?)-?. 


2. Proof of (5). From (2) it follows that 


z.—a nt+a —z 
(12) L,@ (x) = f fs ds, 
: Cc 


2Qri (g — x)" 





for x # 0, where C is a simple closed path about z = x, not containing z = 0. 
Multiply by 


€ NM, Ci, C2, 2-5 Cp; ] 
le ) 
\ 3) Pal Se, Gm ss + Bet ws 
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sum over nm, and switch operations to find: 


_ “ —n, Ci, C2, +++ 5 Cp; n (a) 
(14) Biv, x,t) = ¥ uF] te Sooo eth L, (x) 
é x" “ -" 

~~ Oxi J. z- 74%) dz, 
where 


= M, Ciy+++5 Cpr wm) 
(15) Am) = ¥ uk] ~ cies ig {te : 
Equation (15) can be summed directly or by a special case of a formula of 
Chaundy (2, p. 62) to give: 


> y —_ a a a, Ciy sees Cp; =| 
ames AG) = 2 Fl Oy... 548 83~-8-&) 





The question of interchange of operations in (14) must now be considered. 
The preliminary result (18) to be obtained below is general enough to contain 
both convergent and divergent cases. For p > gq, the series in (16) diverges 
and the result is a purely formal one. For p < gq, the series converges uni- 
formly for a neighborhood of ¢t = 0 and the interchange of operations in (14) 
is justified. In the latter case, replace the equivalence symbol ~ in (14) by 
equals signs. 
In the general case then, 


éx* se * a er = —vtz 
Biv, x,t) ~ aa | — —-—- | eg a, — an 
( 2ni Jos—x—ts""" 1 d..., ny 











(17) ~of Se Si __(Ga)n--- (Gn (—002)" 
2mi Voz —x — le 4% (didn... eer 
a ida. ada (“0 tf _ Fag 
©*% &, (din... (ddan! 2eidc (@—x—te)™ 


In the last line, the same comments as to justification of the interchange of 
operations apply as before. The last line may be converted by (12) provided 
t is sufficiently small. This gives: 








—a = (Ci)n - ++ (Cp) ea a+ —2 
B( ,t)~é: _ fp/n \ Ot) aT p” nm —2 
(0, #1) Oe Gn al — 1 
ois Rs x — (c,) wee ghee = E n! La” (z 2] 
(18) GteGe waa ee Lew 
xt Ci)n-- - (C)n (—2¢)* 


—l—a Pace : cham | fm N , 
toh it exp 2) $ en 5... Gels l — 8) 


L, [x/(1 — t)]. 


Result (5) now follows by taking 


(19) p=1,¢g=1,4q=-a,dq,=1+a 


and 
of Cc 


(25 
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and using the result (2, p. 62, quoted in 1, p. 949): 

~s ; —xt 2 (a)n La” (x) t 
9 ae >» oo... 2 
20) a~# rt iz] 2 Fa). 


Another previously known result (6, p. 98, eqn. (5.1.15)) also follows 
from (18) by taking 
(21) P=0,¢=1d=lt+a 


and using the result (6, p. 98) 


La’ (x) t 


Do t — a an 
(22) e oF \(—; 1 + a; —xt) 2. +a), 


3. Proof of (6). The development of (6) follows a pattern similar to that 
of §2. From (4) it follows that 


al Fal ( edz 
) as atau 2 wees 
(23) H,(x) = (-—1)" ae J. is — x)" 


Multiply (23) by 


‘ 's » —M, Ci, ~~ +5 Cys 
(24) a | di, sal | ’ 


and proceed in a manner exactly like that of §2 to find, after two interchanges 
of operations: 


Or as a i ain 7 —%, Cig e205 Cy; | t* H, (x) 
 emAee E Fl Di cted dy; n! 
7 (Ci)n - - + (Cy)n (vt)" mn! J at 
“Le (d:)n..+ (dg)nm! mt ofse "a 
z? = ( . (vt) 2 —22 
~ AY le Gl OY (1 Hs) 


~ exp(2xt — # >> {Gre eh (ty Holst) 


Result (6) follows from (25) by taking 





(26) p=1,c, =a,q = 0, 

and using the formula (1, p. 948): 

" — oen-+ pl toda + 4;__—40" | ~ F @aHalx) 
— oh -ti-wi~s 2 


4. Proof of (7). The developments in §§3 and 4 depended on Chaundy’s 
summation formula, which was used, for example, to convert (15) into (16). 
The whole procedure may be repeated to yield new results if one will instead 


use (1, p. 947): 
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. 2 
a= Oo naF| — al 





(28) Th, ocep te = 6) 
— = —4n + 4, a,..., Gp; 
_— ft" Fl jn, 2 2? ’ ’ P' 
DF oak e © 
Start again with (12). Then it follows that 
cs = — 1 ‘ 
E(v, x,t) = } | jn, $8 + $, G1, -.-; ap; | L,” (x) t" 
n—0 bs, ..~ » Oe; 
(29) ~ Ss J a A(z) dz, 
where 





_< zt \" —4n, —}n + 3, 41,..., a5; | 
(30) A(z) = > (- = -) F] b, 6.” |: 


The interchange of operations gives valid convergent results if p+ 1 <q, 


and a formal divergent result if p + 1 > g. The application of (28) to (30) 
and the substitution of the result in (29) yields: 


(31) E(u, x, t) 


- ’ io is , 1, ay Ap; vet 
ww —- otal ¢ Sai “i —— dz 
Qari cz—-x—at _ aera (e@—x—a2t) 
Interchange operations again to find: 


(32) E(v, x, t) 














—_ #\" 2 xt a.) — ‘ )n ° . (dy)n (vt)™ 
(1 — #) exp -5 ) +" gee mes a (1 — 2)" 
where 

(33) P a “(1 — t)* exp[x/(1 — t)] ent tt" de 





21 aI xt r 
ioe 


For ¢ sufficiently small, xt/(1 — ¢) lies within C and hence 
(34) F = L2,@[x/(1 — 2]. 
So far then, the result is: 


(35) E(u, x, t) 


= py eco —8t) (Bn (ain = - @p)n (0t)™ o x ) 
i — 6) exo; = )e o (bilo... Ode (1 — ft)” Len \T—7)° 


As a special case of (35), take 





(36) p = 2,q = 3,a; = }a,a2 = 3a + 3,6; = 3,52 = 4 + $a,b; = 14+ fa, 


)) 
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and recall that: 


(37) (2c)on = 2” (c)n (C + 4)n- 


Then (35) becomes: 


+ 
5 
4 ae Z -#.) — (A) a» (vt)™ @( 
=( 4) exp( 1-—t La + a)o, (1 = Le 


From the known result (20), it easily follows that 


sr a; —xt ~~ a ; xt 
iil Flite at l+ato Ata! 


:1 

= (a)s, Le” (x) # 
_ ./. L. Sse 
a > (1 + )on 


(39) 


2n 


The application of (39) to (38) yields (7). 


5. Proof of (8). 


As a special case of (35), take 


(40) pb = 0,¢ = 3,b; = $,b2 = 4 + fa, db; = 1 + faz. 


From the known result 


t" Ly * (x) 


t y . . ents = —_ uhes 
(41) e' oF; (—;1 + a; —xt) yi Te or 


it is easy to get: 


(42) e* oF, (—; 1 + a; —xt) + &' oF: (—3 1 + @; xt) 
es = La (x) 
=? ~———=--—— 
22 (l+a)n — 


The application of (42) to the special case represented by (40) gives (8) 


6. Proofs of (9), (10), (11). Start from (23), multiply by 


n 1 
. t | —an, —4n + 3, 4),...;, dp; 2 
(43) mal ; , at 


| pt? 
nN: 
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sum on m, and interchange operations exactly as done previously. The result, 


stated without intermediate steps is: 


> r) -™ —in + },a1,..., Ay; |e) t" 
p+2 ¢ a ' 


(44) n=0 baal mye l n 
2, ee (d1)n- - - (Ap)n ($0t)” 
~ exp(2xt — SOs) --- (Sp, Ho,(x — 1). 
exp(2xt — f) 2. "pb n! 
In (44), take 


(45) pb = 2,¢g = 1, a; = }a,a2, = fa + 3,5, = 3 
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From (27), obtain 
1 1 - 2 
3 — 2a) yr 3080 +4; =A?) 

(46) (1 2xt) Fl bes = 

+(1 + 2xt)~* 1, 3a, ha T 3; _ —4t | 

= (a) an Hon (x) " 
~D? eal A. Renee - 
22 (2n)! 


The combination of (44), (45), and (46) will yield (9). 
As the next special case of (44), take 


(47) ’=0,¢g=1,h = 3. 
From (6, p. 102) 
2 x) ft 
exp(2xt — f°) = : H(z) 
10 m1: 
obtain: 
(48) e~* cosh 2xt = > Han (x) 


n=0 (2n)! 


The combination (44), (47), and (48) will yield (10). 
For the last special case of (44), take 


(49) p=1,¢ =1,a; = a,b; = }. 
Substitute the result (6, p. 102): 
(50) Hx» (y) = (—1)" 2™ n! L,—» (y?). 


So far, (49) and (50) give, for v? = u: 


: 1 1 1 , -\ #” 
™ — 9n, —3n + 3,4; H,,(x) t 
(51 ) » Pl a . 1. u| } 


= exp(2xt — ¢’) >> (a), (—ut’)" 


a0 (3)n 


L, ((x — #)’). 


The application of (20) now yields (11). 


7. More general results. Make the definition 
52 | a,c a» +--+ 29% «| 
02) Ja 9, Bs. +s Bei 


= ripl, 


where k = 1,2,3,.... 


Then by direct summation and use of the relation 
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(54) (Chey = RX (c/k), (CC + 1)/k),... ((C +R — 1)/k), 
it follows that 
e ot! o+h~-1 | 
(55) (1 — t)™ sap ee . oupeee’ => | 


m (a), t fe Oty Olay + + 4 Mp} 


— ee ’ Bi, Bo... » Bes , 
Equation (55) is the obvious generalization of (1, p. 947, eqns. 24 and 25). 
For a = 1, (55) gives a generalization of (28), which is the special case k = 2 
of (55). 

It is thus possible to generalize the procedures of the above sections. The 
results are so general and so complicated, it does not seem that they would 
be of much use. Just one of the many possible is presented here for illustration. 
Let w, be a kth root of unity, other than 1 itself. Then: 


’ k—1 »* te, \ 2 
(56) kd — D-* exp - za), (1 + aie) 
a ar pi 


aI a; Wy bx | 
“Lil +a;(1— 11 —t + wi tr) 


aa+t+il e+k—1. 
Ras - B ' (a 
Sey) es* Ff 12) 
— 1 2 k—-1 l+ea 2+a k+a_ 
_it ope ee Dy ee ae. 
For k = 2, (56) reduces to (7) and for k = 1, (56) reduces to (5). The inter- 


ested reader will be able to find similar generalizations of (6) through (11) 
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SOME POLYNOMIALS OF TOUCHARD CONNECTED 
WITH THE BERNOULLI NUMBERS 


LEONARD CARLITZ 
In a recent paper (4) Touchard has constructed a set of polynomials Q, (z) 
such that 


(1) B’9,(B) = 4°. gy ” 


where after expansion of the left member B” is replaced by B,,, 


Br — rn x 
e™* = >) B,x"/m! = =—, 
m=0 ée-l 





and 

a (—1)"* 1 (n!)* 
2 K, SS oe ' -_y + pa =i 6 
(2) Qn +1 2" (1.3.5... (Qn —1)f 
(Touchard writes Q,(z) in place of 2,(z); we have changed the notation in 
order to avoid a clash with the Legendre function of the second kind.) It is 
proved by Touchard that 


4 


(3) Oeys(2) = (28 + 1) M(2) + Pr — M-a(2). 


Using (3), Wyman and Moser (5) showed that 


_ an 2n\~" * +n — 4 
(4) a,(s) = 2°n! (2") p> icipea )° 


In the usual notation of generalized hypergeometric functions (4) may be 
written 


nil wn — 

2n\~ oe a 
(5) Q(z) = 2"n! (2) . + *). —  - 

- a l,-s-=, -s + .~% 

, 2° 2 2 
Now Bateman (1; 2) has introduced a polynomial 
- 443 ‘ 

(6) F,(z) = .F. waked + ctiea | 
such that 
(7) F,(—2z) = (-—1)"F,(2); 


also in place of (6) there is the alternate expansion 
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ae 2..8 i228 iss. 
alz a se =" | * 
(8) F, (2) = (—1) (*),r, z+ , a @ z+2—n 
a i a: 
Using (7), (8) becomes 
ee Ea 
= 


(9) F,(2e+1) = (—1)" ‘ 7 "Vr 


Comparison of (9) with (5) yields at once 


a" 
(10) Q(z) = (—1)" 2" n'(?*) F,(22 + 1). 


In the next place we recall Bateman’s formula (2) 


(11) r.(4) z cosech z = cosech z.Q,(coth 2z), 


where Q,(z) denotes the Legendre function of the second kind. We have also 
in the notation of Nérlund (3, Ch. 2) 


z cosech z = p D2" /m! (Dansi = 0), 


where (symbolically) 
(12) D, = (2B + 1)". 


Expanding the left member of (11) we get 


@ r 


> 5D'F,(D); 
r=) «7: 
in view of (10) and (12) this is equal to 


(—1)" (2"n!)~ ep - (2B + 1)’ 2,(B). 


re 


Since 
_ 2(n!)*_ 1 (® . S155 3.) ) 
Q() = Gra pi rt ata agtligtsi# (\2| > 1) 
we accordingly get 
(13) > 5 (2B + 1)'9,(B) 


2" (n')* , 


a om atl n 1 n ' 3. » 
= (-—1) (Qn + 1)! Qnyi mh z sech (842,841; 043; tanh's). 
From (13) it is clear that 


(2B + 1)’2,(B) = 0 (O<¢r<n), 
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and therefore 


(14) BQ, (B) = 0 O<r<n). 


As for r = n, we have 
-2 6 
2" (n!) 


(2B + 1)°9,(B) = (—1)" Gy ami 





using (14) this becomes 


: . aie ee (n!)* D, —s 
(15) Ba, (B) = ( "Fn “+ 11.3.5. Qn — 1p = * 


This evidently completes the proof of (1). 
It may be of interest to remark that (1) can be verified rapidly in the following 
way. Using (6) and (10) we see that 


— 2n\_1 —n,n+1, B+1; 
en nt (2") Fl 1,1 | 

es 2n\iiws ays {oa on 
ont (2) 2, ( 1) ‘ s s : 


s! 
(B+1)(B+2)...(B +5) = — 





2, (B) 


ll 
-—~- 
| 
_— 
— 


But (3, p. 149) 


so that 


~ :(n\(n+s\(/B+s n\(n+s\ 1 
2X (-1) (")( s > )- > (-1 *(")( ) ) s+ 1 
= F(—n, n+ 1; 2; 1) = 0. 


Thus Q,(B) = 0. Repeated use of the recurrence (3) now completes the proof 
of (14). Finally (3) yields 








¢ n =a (.})* . “(n = y. 
(2B + 1)°2,(B) = (—1)" Ga — 1) @@ — 1)" = i, . (4-1) 
a (n!)* os 
= (-) GF Dien —1)...34P’ 
which gives (15). 
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ON TOUCHARD POLYNOMIALS 


FRED BRAFMAN 


1. Introduction. Wyman and Moser (5) have recently given an explicit 
form for some polynomials of Touchard (4). Equation (6) of this note will 
give a simpler expression for these polynomials. Subsequently, generating 
functions (9) and (12) will be obtained. 


2. A new expression. The expression given by Wyman and Moser in (5) 
is 


4 Lal ‘ +n-— "ey 
= 2" m! 

(1) Q,(x) = 2 ni(” > an Se J ° 
If (1) is written in generalized hypergeometric form it becomes 


' _ ni (2x + I) | -¥ }(—n + 1), —x, —x; | 
(2) Qn (x) - (3)n 2" iF i -<¢= te -e4 a ae Aa; , 


For the time being, restrict x to the values 


(3) x = M, ma«(0,1,2,..., 


and use the transformation formula from Bailey (1, p. 85, eqn. (3)): 


-f, 8, —W; 
(4) FP) . me = | 


_ (1 + 3n)m (4 + 3%) m F —4n, }(—n + 1), —m, =m] 
ny (4)m (1 + 2) m 411, —m — 4n, —m + 4 — 4n; J" 








' _ n! (2m + 1)» (4)m (1 + 2) i in gn | 
(5) Qa(m") = GO + dat) GF Im ; 


_ (1+m), n! r|~™ —n, —m; 
~ 4) *~* 1,—n—m; 
Thus it has been shown that the Q,(x) of (2) reduces to 
‘ _ (1+ x)s n! | -* —n, —X; | 
(6) Q, (x) = (3)_ 2" 3F2 Snap ae de 


for the special case of x any non-negative integer. But then, for fixed n, (1) 
and (6) (in the form (7) below) give two polynomials of degree m in x which 
coincide at an infinite number of values. Hence (2) and (6) are identical and 
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(6) is a valid representation for general x (except for negative integers, in 
which case (6) is indeterminate). 

By the many transformations possible on a ;F, of unit argument, equation 
(6) can be converted to other forms. 

Equation (6) here is equivalent to the result obtained independently by 
Carlitz in equation (10) of the immediately preceding work (2). This equi- 
valence may be demonstrated by the formula given by Bailey (1, p. 22): 


I’ (ares) TP (ar2s) TP (@res) Fp(0; 45) 
= (—1)" T'(ei2s) Taos) T(aors) Fn (5; 02). 


Carlitz has shown that the Q,(x) are essentially the Bateman polynomials 
F,,(2x + 1). Hence the results to be obtained below may also be interpreted 
as results on the F,,. 


3. Some more generating functions. The form (6) makes available 
generating functions in addition to the one given by Wyman and Moser in 
(5). Convert (6) into: 














-% n! iat (1 + x)a» (—*)x (—1)* 
7) Qn(x) = oy See. 
Then 
‘ = On(x) (4)n (2t)" 
” a n\n! n! 
_ ~ * +29 (—2) (-1)*F 
=, (n—k)! (n —k)! kik! 
SS (1+), (—x), (-1)° 
" Dy > n' n' k! k! 


1F(1 + x; 1; t) + 1Fi(—x; 1; —2). 


The last line may be converted by Kummer’s first formula to get the generating 
function: 


(9) e‘[:Fi(—x;1; -—)) = > Qu() Gs 


Ane 


It should be noted that for x a non-negative integer, the ,F; on the left of 
(9) becomes a Laguerre polynomial, so that (9) is: 





(10) e(L.(—)]" = > Qu(x) (3)n (24)"— 


n! n! n! 
For the next generating function, sum (7) as indicated below to get: 
Qn(x) (4)n (2t)" 
an :;* n! n! 
(1 + *)n-x (—*), (—1)* nl 
<0 00s (@ — kj! (nm — hk) Ri! 





int 





Ny 
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> (1 + x), (—x), (—1)* (n +k)" 








a= ba? nn! k! k! 
_F +2)" (—x) (—1* (1+), 
7 2 n! 2 k! k! 


->°% tet pf! +n, a -+| 


—a tor Ze Othe pf —™ Ft]. 


By a well-known formula due to Chaundy (3, p. 62) the last line above sums 
to give the generating function: 





(12) (1+¢)?(l1-—t)** “* sit #1 =f | 
_ F Alx ) (3). (24)" 
> n! n! 


The »2F; in (12) is essentially a Legendre polynomial if x is a nonnegative 
integer, in which case (12) becomes 


(13) a¢y'a-0-> “p(ite) $e Dale) Bn (21)* 


— n' n! 
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ASYMPTOTIC EXPANSIONS II 
LEO MOSER ann MAX WYMAN 


1. Introduction. In a previous paper (1) the authors considered the 
problem of finding an asymptotic formula for numbers or functions B,,,, 
whose generating function is of the form 


« x" 
(1.1) exp(P,, (x)) - Do Bama 


n=0 


where P,,(x) is a polynomial of degree m in x given by 


(1.2) P,(x) = > a,x", aq, * 0. 
k=l 


The above-mentioned paper contained the restriction that a, > 0. In 
our present paper we remove this restriction and allow the coefficients a, to 
be positive, negative or zero. However we do retain a, ~ 0. In (1) it was 
shown that there is no loss in generality in assuming that the greatest common 
divisor of the values of k, for which a, # 0, is one. This assumption we shall 
also retain throughout the present paper. 

Since the degree m of the polynomial P,,(x) is fixed we shall, wherever 
possible, suppress m from our notation. Iterated exponential functions occur 
throughout the paper. For this reason we shall use alternative notations 
e* or exp(x) to denote the exponential function. In this notation we write 
(1.1) and (1.2) as 


(1.3) exp(P(x)) = 2 Bai 
(1.4) P(x) = > a, x’, a, = 0. 
k=l 


2. Trigonometric polynomials. In our previous paper (1) the trigono- 
metric polynomial S(R, 6) associated with P(x) by means of 


(2.1) S(R, 0) = R{P(Re“)] = > a, R* cos ké 


played a very important role. We shall call this function the dominant function 
of P(x) and throughout the paper R shall be considered as large and positive. 

When the coefficients a, > 0 the dominant function has a greatest maximum 
at @ = 0. Further in a sense, which we shall explain later in the paper, this 
greatest maximum is unique in the range 0 < @ < x. However when we do 
not restrict the sign of the a, the greatest maximum will, in general, occur at 
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some value of @ which depends on R. There may also be other maxima which 
we shall define to be equivalent to the greatest maximum. From the point of 
view of finding an asymptotic formula for the B, these equivalent maxima are 
important. In fact the asymptotic formula is obtained by expansions about 
the angles which give the greatest maximum and its equivalents. 

For the above reasons we shall, in this section, discuss and classify the 
maxima of the dominant function S(R, @). Throughout the section we restrict 6 
to be in the range 0 < 0 < =. 

Denoting differentiation with respect to @ by a prime then the values of 6 
which make S(R, @) a maximum are contained in the solutions of the equation 


(2.2) S'(R, 0) = — > ka, R’sin ko = 0. 
k=l 


For a large positive value of R it is easily shown that all of the solutions of 
(2.2) for @ must have the form 


F 1T — 7 
(2.3) 6,(R) = —+ >> b,,R, 
m s=l 
where r is an integer in the range 0 < r < m and b, , are constants. 
The only values of r for which 6, will give a maximum of S(R, @) are those 
values that will satisfy the equation 


(2.4) a, cos(rr) = |a,| 


ls 


which of course implies that consecutive maxima are separated by an angular 
displacement of approximately 2x/m. 

Definition 2.1 The angles 6, of (2.3) which yield maxima of S(R, 6) shall 
be called the asymptotic angles of S(R, 0). The reason for the name will become 
apparent later in the paper. 

Definition 2.2 Let 6,, 0, be two asymptotic angles of S(R, @). These shall 
be called equivalent asymptotic angles if and only if 


(2.5) lim (S(R, 6;) — S(R, 6;)) = finite constant. 
Row 


Definition 2.3 The maxima associated with equivalent asymptotic angles 
are called equivalent maxima. 


THEOREM 2.1. There is no other asymptotic angle equivalent to 6 = 0 when 
6 = 0 is an asymptotic angle of S(R, 6). A similar theorem holds for @ = rx. 


Proof. Let us assume there is an asymptotic angle @, contained in (2.3) 
which is equivalent to 6 = 0. Two possibilities exist either @, is independent of 
R and hence b, , = 0 or else 6, depends on R and there exists at least one 
b,, # 0. 

For the first possibility 0, = ra/m and 


(2.6) S(R,0) — S(R,0,) = >> a R* (1 — cosk8@,). 
k=l 
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Hence 
lim [S(R, 0) — S(R, @,)] 


Ri 
is finite if and only if 
(2.7) cos k 6, = | 


for all values of k, 1 < k < m, for which a, # 0. Since the greatest common 
divisor of the above-mentioned values of & is one it is easily shown that the 
only solution of equations (2.7) in the range 0 < 6, < x is 6, = 0. 

Let us now assume that the constants b, , are not all zero. This implies that 
there must exist a value of k in (2.2), for which a, + 0, and such that 
sin k(rx/m) ~ 0. Otherwise 6, = rx/m would be a solution of (2.2) and we 
would be back to the case where 6, is independent of R. Let us suppose that 
k = q is the largest value of k such that 


(2.8) sin(grx/m) #0, a, # 0, l<gq<em-—l. 
Under these conditions we can easily show from (2.2) that 
(2.9) 6, = = + O(1/R™™. 
m 
We shall now consider values of k > qg for which a, # 0. For all such values 
of k we have by assumption that 
(2.10) sin(kra/m) = 0. 
Hence by using (2.8), (2.9) and expanding cos(k@,) we have 
(2.11) cos(k@,) = cos(krr/m) + O(1/R*"-**), 
This implies that 
(2.12) a,R* cos k@, = a,R* cos(krar/m) + O(R*+24-2"), 


Since k < m and gq < m — 1, k + 2q — 2m < ¢ — 1. Hence the order term 
cannot affect any terms of order R‘, s > g. A similar result can be obtained 
for the single term k = g. This means that 


(2.13) > a R’cos k0. = >> a, R’cos(krx/m) + O(R*™). 
k=1 


k=q 


From (2.13) we can conclude that 
lim [S(R, 0) — S(R, 6,)] 
Roc 


is finite only if 
(2.14) cos(krar/m) = 1 


for all values of k, g < k < m, such that a, ~ 0. However a, # 0 by assump- 
tion. Hence (2.14) implies cos(grx/m) = 1. This in turn means sin(grx/m) = 0 
which contradicts (2.8). Hence if @ = 0 is an asymptotic angle of S(R, 6) it 
has no equivalent asymptotic angles. A similar proof can be given for 6 = r. 
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THEOREM 2.2. Let 6, and 6, be two non-equivalent asymptotic angles of S(R, @). 
Then 
(2.15) S(R, 0,) — S(R, 0;) = OCR, 
where q is some integer in the range 1 <q < m — 1. 


Proof. Using (2.3) one can easily see that S(R, 6,), S(R, 6;) have expansions 
in the powers of R of the form 


(2.16) S(R, 0.) = OC, R’, 
(2.17) S(R,0,) = >> C, R’, 


where C, and C, are constants such that C,, = C,,. Since the maxima are not 
equivalent there must be a value of s for which C, # C,, 1 < s << m— 1. 
If we denote by g the largest such value of s then 


(2.18) S(R, 0.) — S(R, 0) = O(R®. 


Definition 2.4. Let $; < ¢2 <... < , be the subset of asymptotic angles 
which includes the asymptotic angle for which S(R, @) is equal to its greatest 
maximum and all asymptotic angles which are equivalent to this latter 
asymptotic angle. The asymptotic angles ¢, (i = 1,2,...,¢) are called the 
maximal asymptotic angles of S(R, 6). 


THEOREM 2.3. The set of maximal asymptotic angles consists only of one 
angle if either @ = 0 or 0 = x is a maximal asymptotic angle. 


This follows immediately from Theorem 2.1. 


THEOREM 2.4. Let us assume that the set of maximal angles consists of a single 
angle ,, such that 0 < ¢; < . If ¢ is defined by 


(2.19) « = RO-™/s, 
then 
(2.20) S(R, 0) < S(R, 61 — ©) 


for all values of 6 in the rangeO0 CO< di — eordi te dK. 


The proofs for the two ranges of @ are similar so that we shall consider 
only the range 0 < 6 < ¢; — «. In this range we note that there must always 
exist at least one minimum of S(R, 6). This is easily seen because @ = 0 is 
always a solution of (2.2). Hence S(R, 0) is either a maximum or minimum 
of S(R, 6). If S(R,0) is a minimum then there is nothing more to prove. 
If S(R, 0) is a maximum:then there is a minimum in the range 0 < @ < ¢; 
because S(R, ¢;) is also by assumption a maximum. However all the maxima 
and minima are determined by (2.3), hence the angles at which they occur 
are always a finite distance apart. Since «+ 0 as R-—» © we must have that 
there always exists at least one minimum of S(R, @) in the range 0 < 0 < @i— «. 
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If there is only one minimum of S(R, @) in the range 0 < 6 < ¢; — ¢€ and 
if it occurs at 6 = 0 then the proof of the theorem is trivial. In this case 
S(R, @) is an increasing function in the range 0 < 6 < ¢; — ¢€ and hence for 
this range S(R, 0) < S(R, ¢1 — «). 

Let us now assume that there are other minima of S(R, @) in this range 
and denote by y > 0 the angle closest to ¢; for which S(R, @) is a minimum. 
Since S(R, @) is increasing in the range y < 6 < ¢; — € we must have S(R, @) < 
S(R, $1 — ¢) in this range. Hence to complete the proof we need only consider 
the range 0 < @ < y. Again it is easily shown that in this later range a maxi- 
mum of S(R, @) must exist. We denote by a the angle at which the greatest 
maximum of S(R, @) occurs for the range 0 < @ < y. We note of course that 
a is an asymptotic angle. Clearly for the range 0 < 6 < y, S(R, 0) < S(R, a). 
Hence 


(2.21) S(R, 1 — €) — S(R, 0) > S(R, d1 — ©) — S(R, a). 
Expanding S(R, ¢: — «) in a Taylor’s expansion, and remembering that 
S’(R, 1) = 0 because S(R, ¢:) is a maximum, we must have 


(2.22) S(R, $1 — ©) = S(R, $1) + 4S8"(R, os) @+.... 


Since S(R, ¢;) is a maximum S’’(R, ¢;) is negative and is of order O(R”) 
in R. Hence using (2.19) 


(2.23) S(R, 1 — €) = S(R, 1) — OCR"). 

From (2.21) and (2.23) 
(2.24) S(R, 1 — €) — S(R, 6) > S(R, 6:1) — S(R, a) — OCR"). 
However, $1, a are non-equivalent asymptotic angles, hence, by Theorem 2.2, 
(2.25) S(R, o1) — S(R, a) = OCR". 
where q is an integer in the range 1 < ¢ < m — 1. Thus (2.24) and (2.25) 
imply 
(2.26) S(R, @1 — ©) — S(R, 6) > O(R®. 


By assumption S(R, ¢;) is the greatest maximum of S(R, @) in the range 
0 < @ < x. Hence the order term must be positive. When this is so (2.26) 
gives 
(2.27) S(R, 0) < S(R, o: — ©). 


This completes the proof for the range 0 < @ < ¢: — e. The proof for the 
range @; + «€ < @ < = is similarly obtained. 


THEOREM 2.5. Let us assume that the set of maximal asymptotic angles 
o1 < $2 <... < @, consists of at least two angles. Further let us assume thal 
; and $44; are two consecutive maximal asymptotic angles. If « is given by 
(2.19) then one of the following inequalities must hold: 


us 
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id | (2.28) S(R, 8) < S(R, 6 + ©) 
se or 
or (2.29) S(R, 0) < S(R, du: — ©) 


for all values of @ in the range ¢; + « < 0 < @u1 — «. 


e 
We omit the proof of this theorem because of its similarity to the proof 
< used in Theorem 2.4. 
} “ae 
er THEOREM 2.6. Let us assume that the conditions of Theorem 2.4 are satisfied. 
cI- Further let I,, Iz be defined by 
st 
41¢ 
at (2.30) i,= f expP(Re“) — P(Re*') — ind) do, 
x). z 
} (2.31) I, = f exp[P(Re) — P(Re*') — in6! dé, 
oite 
where n 1s any real number. Then a number k > 0 exists such that 
at 
2.32) \I,| < exp(—RR"*) 
(2.33) \Is] < exp(—RR"*). 
Proof. 
m) | 1—< 
hl < J exp[S(R, 0) — S(R, ¢1)] dé. 
0 
By Theorem (2.4), S(R, 6) < S(R, @: — ©). Hence 
(2.28) \Th| < wexp{S(R, od: — ©) — S(R, $;)}. 
However, using (2.23) we have 
2, (2.29) I,| < r exp(—kR™). k> 0. 
We may absorb the x into the exponent and write 
5) | (2.30) Ih] < exp(—kR™*). k> 0. 
The proof for J. is similar. 
ge « THEOREM 2.7. Let us assume that the conditions of Theorem (2.5) are satisfied. 
6) Under these conditions the absolute value of each of the integrals: 


wo 
w 
— 
> 

Il 


%1—€ 
| exp[P(Re“) — P(Re*) — ind] do, 
0 


Pi+i-—€ 
he w (2.32) J, f exp[P(Re”) — P(Re®’) — ind] d0, i=1,2,...,t—1, 


{ “ere 
(2.33) J,= J exp[P(Re”) — P(Re*) — ind] do, 
les cgi 
val is at most of order exp{ —o(R'/*)} no matter which maximal asymptotic angle ¢, 
by is chosen. 


The proof is essentially the same as that used in Theorem 2.6 
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THEOREM 2.8. Let P(x) be given by (1.4) and as usual S(R, 6) denotes the 


dominant function of P(x), given by (2.1). We shall denote by +r the complex 
number 


(2.34) r = Re* 
and by ® the operator 
d 
e=1r—. 
(2.35) ) oe 
If m is a sufficiently large real number then the equation 
(2.36) @P(r) =n 


has a unique solution for r corresponding to each asymptotic angle of S(R, 6). 
Further the solution r,(m) corresponding to the asymptotic angle @,, given by 
(2.3), satisfies the equations 


(2.37) arg tT, = 0, 
and 
(2.38) Ir,-(m)| = R,(n) ~ (n/mfia,,|)'/. 
Proof. From (1.4) 
(2.39) @ P(r) = > kar = > ka, R’cos(k 0) — i S’(R, 8). 
k=1 k=1 
Hence equating the real and imaginary parts of (2.36) we find 
(2.40) > ka, R’cosk 0 =n 
k=1 
and 
(2.41) S’(R, 0) = 0. 


Equation (2.41) is of course the same as (2.2) hence all of the solutions of 
(2.41) for @ are given by (2.3). Let us choose a particular @, which is also 
an asymptotic angle of S(R, 6). We have already seen that this implies 


a,cosrr = |a,|. If we substitute (2.3) into (2.40) and expand into powers 
of R we see that (2.40) can be written 
(2.42) m\a,| R™ + O(R™') = n. 


Although the first term is independent of r the order term will in general be 
dependent on r. From (2.42) it is easily seen that for large values of n the 
solution of (2.42) for a real positive value of R is unique for each fixed value 
of r. Denoting this solution by R,(m) it is also easily seen 


(2.43) R,(n) ~ (n/mla,,|)'/. 


Since 6, = arg r,(m) and R,(m) = |r,(m)| the theorem is proven. One may 
also show without too much difficulty that if R,(m) and R,(n) are two different 
solutions for |r| corresponding to different asymptotic angles that 


lim [R,(n) — R,(n)] = 0. 


0). 
by 


be 
the 
lue 


lay 
ent 
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In concluding this section we might mention that only the maximal asymp- 
totic angles are necessary in developing our asymptotic formula for B,,. 


3. An example of equivalent maxima. By Theorem 2.3 we know that i! 
@=0 is a maximal asymptotic angle that S(R,@) has no other maximal 
asymptotic angles. This result is, of course, only true when we assume that 
the greatest common divisor of the values of k, for which a, # 0, is one. 
For a long time the authors conjectured that under the above assumption 
any dominant function S(R,@) would have only one maximal asymptotic 
angle. Since the conjecture proved to be false it is of interest to give an example 
in which the dominant function has two such angles. 

Let us take P(x) to be given by 


(3.1) P(x) = x9 4 x99 — 24 4 ld — yl0, 
Clearly the G.C.D. (90, 30, 24, 15, 10) = 1. The dominant of (3.1) is given by 
(3.2) SCR, 6) = R®”® cos(900) + R* cos(300) — R* cos(246) 

+ R'® cos(156) — R'® cos(108). 


(3.3) Clearly the solutions of S’(R, @) = 0 which correspond to maxima of 
S(R, 6) are of the form 


(3.4) 6, = = + O(1/R"), y= 0,1.2,..., 45. 
45 

Hence 

(3.5) S(R, 6,) = S(R, re/45) + O(1/R*®). 


This means that the maximal asymptotic angles can be obtained by choosing 
those values of r which will make S(R, rr/45) as large as possible. In each 
case the coefficient of R® is one. All the other coefficients of powers of R 
depend on r. In order to make S(R, rx/45) as large as possible we start with 
coefficient of R*° and make this coefficient as large as possible. Then, in turn, 
we deal with each of the coefficients of the lower powers. 


(3.6) SCR, rr/45) = R® + R*® cos(2rx/3) — R™ cos(8rr/15) 
+R" cos(rx/3) — R'® cos(2r2r/9 


Step A. cos(2rx/3). The values of r in the set 0, 1, 2, ..., 45 which make 
cos(2rr/3) = 1 arer = 0, 3,6, , 45. 
Step B. —cos(8rx/15). The values of r in the set 0, 3, 6,... , 45 which make 


—cos(8rx/15) as large as possible are 6, 9, 21, 24, 36, 39. 
For these values —cos(8r7/15) = cos(2/5). 
Step C. cos(rr/3). The values of r of the set 6, 9, 21, 24, 36, 39 which make 
cos(r7r/3) = 1 arer = 6, 24, 36. 


Step D. —cos(2rr/9). The values of r of the set 6, 24, 36 which make 
—cos(2rr/9) as large as possible are r = 6, 24. 
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Hence there will be two maximal asymptotic angles corresponding to the 
values r = 6 and r = 24. 

It is interesting to note that if the sign of x'® in P(x) is changed to positive 
then we would again have had a unique maximal asymptotic angle. Namely, 
the one corresponding to r = 36. 


4. Asymptotic formula for unique maximal asymptotic angles. 
When @ = 0 is a maximal asymptotic angle of the dominant function we have 
seen that there are no other maximal asymptotic angles. For this case the 
derivation of an asymptotic formula for B,, as given by (1.3), can be obtained 
by using the same procedure as was used in our previous paper (1). The 
proofs and final formulae are identical. For this reason we state, without 
proof, that the first term of the asymptotic formula is 


! ‘ ' 
(4.1) By ~ PLD) (ae A*P(R)), 
where A is the operator 

d 
(4.2) A= RoR 


and R as a function of n is given by 
(4.3) A P(R) =n. 

Other terms of the asymptotic formula can be obtained from the general 
formula given in (1). 

Further the fact that 


m 


(4.4) A*P(R) = Dra R 
k=1 
allows us to reduce (4.1) to 


“ n! exp(P(R)) 
+5) Be ~ Bx) (aq) 
with R again given by (4.3). 

The case @ = x can be reduced to the case 6 = 0 by replacing @ by x — @. 
This is equivalent to replacing R by —R. For this case the first term of the 
asymptotic formula for B, is given by 


~ (=1)" exp(P(—R)) 


(4.6) B, ae (2% A*P(—R))™ 
and R as a function of n is given by 
(4.7) A P(—R) = 1. 


The formula that corresponds to (4.5) is 


(—1)* exp(P(—R)) 
(4.8) B, , (2xla,,{)' mR"*™ 


and R is given by (4.7). 
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We now proceed to discuss the case when the dominant function has a unique 
maximal asymptotic angle say at 
(4.9) 6=a(R), a(R) #0, a(R) #t. 


Throughout we shall simply write 


(4.10) a(R) = a. 
From (1.3) we have 
Pi 
(4.11) B, = =; (exp(P(x) | 
dx _ 
By Cauchy's Theorem 
n! q (n+1) 
(4.12) B, = 5— Z exp(P(z)) dz, 
27m Cc 


where C is chosen as the circle z = Re**. At this stage the radius of the circle 
R is an arbitrary positive number. Equation (4.12) can be written 


- eo? ae 
(4.13 B,, aR J explz (Re’ ) in6\ d 
n! — ee 
= — {| MR | exv[P(Re") — ind dé. 
aR 0 


If we define the complex number + by 
(4.14) r = Re™ 
then (4.13) can be put into the form 
4 n! - wi 
(4.15) B, = vi exp(P(r)) fexplP (rexp(i(@—a))) — P(r) ina) | 
T v0 
We shall show that the integral of (4.15) has an asymptotic expansion in 


terms of powers of 1/r. However by using Theorem (2.6) we can easily show 
that the integrals 


(4.16) { exp[P(r exp(i(@ — a))) — P(r) — in(6 — a) dé 
and 
(4.17) | exp[P(r exp(1(@ — a))) — P(r) — in(6 — a)] dO 


are both of order exp(—O(|r|'/*)) when «¢ is given by (2.19). Anticipating this 
result we may write 


! wate 
(4.18) Bw sf "exp P(r) | exp{ P(+ exp i(0—2)) — P(r) —in(0—a) }do | 


a 


We define J to be the integral contained in (4.18) and replace 6 — a by @. 
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Hence 

(4.19) fe | exp(F(6)) dd, 
where 

(4.20) F(6) = P(re) — P(r) — ind. 


Remembering that @ is the operator given by 
(4.21) @ = rd/dr 


we have on expanding F(@) about @ = 0 that 


(oy 
ki ° 

We have assumed that there exists a unique maximal asymptotic angle 
6 =a and by Theorem (2.8) there corresponds a unique solution of the 
equations 
(4.23) 6 P(r) =n 
(4.24) arg rT = a. 


(4.22)  F(6) = (@ P(r) — n) i — 0° P(r) 5 + ¥ 0 P(r) 
k=3 


By choosing + to be this unique solution we may write (4.22) as 


io)" 
k! 

Since 0 < arg r < x there is no loss in generality in considering the complex 
plane of + as cut along the negative real axis. For this reason there is no 
ambiguity in finding the square root of r. Similarly we shall find that there is 
no ambiguity in finding the square root of certain polynomials in + which later 
enter into the discussion. 

For simplicity we introduce the following notation: 


(4.25) F() = — 0° P(r) + & ot P(r) 


(4.26) raZ",Z=1'; 
ad = 0(40° P(r)! 
ees = (40° P(r); 
(4.29) C,(Z) _ : s‘a, | a ae. 
s=1 
(4.30) f(Z) = C.(Z) Z"*™ (4C,(Z))*”; 


(4.31) v(Z, >) 


II 
Ss, 
—~ 

N 
> 

lL 

= 
© 


In this notation the substitution (4.27) reduces the integral (4.19) to the 
form 


' eh ; 
(4.32) [= (2) J ext- @ + ¥(Z, ¢)) d¢. 


? 
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We note that although @ is a real variable that ¢ as given by (4.27) is, in 
general, complex. Similarly A is, in general, a complex variable. From Theorem 
(2.8) we know that 


(4.33) lim |r| = lim R = o. 

It is easily shown that |h| = O(|r|!/*) and hence 
(4.34) lim |h; = @, 

Finally one can show without too much difficulty that 
(4.35) lim arg h = 0. 


Our next step is to expand exp(y¥(Z, @)) into a Maclaurin expansion about 
Z = 0, of the form 


igle 7 
the (4.36) exp(¥(Z, ¢)) = Do Ve (¢) 2", V = 1, 
k=0 
where the ¥,(¢) are polynomials in ¢. Quite formally one would have on 
integrating term by term and replacing h by © that 
(4.37 I ( A,» fiers (¢) do Z' 
' ot) @ P(r) - & k tf ° 
The rigorous justification of (4.37) requires only a slight modification of the 
es procedure that was used in our previous paper (1). For this reason we omit 
- this justification. Defining }, to be given by 
no 
deve (4.38) b, = f e* v, (¢) d¢, 
ter es 
we can obtain from (4.18) and (4.37) that 
(4.39) B, ~ r' 2' n! af +00" P(r))exp(P(r)) > br | 
and + is given as the solution of 
(4.40) 6 P(r) = n, 
which corresponds to the unique (by assumption) maximal asymptotic angle of 
the dominant function of P(x). 
Since by) = (x)! the first term of (4.39) is 
(4.41) B, ~ 2)(n!) R[x" (x 6? P(r))— exp(P(r)))]. 
We can, if we so desire, replace (@2(P(r)))! by 
- (4.42) (6? P(r))? ~ mla,{}|r|*™; 
hence 


(4.42) B, ~ n! R[r-* (22-8 exp(P(r))/mla,,|* |r/*"). 
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In concluding this section we would like to point out that formula (4.39) 
does not hold for a = 0 ora = r. The reason being that a maximal asymptotic 
angle internal to the range 0 < 6 < x gives a contribution to the asymptotic 
formula from both sides of the maximal asymptotic angle. However at the 
boundary points of the interval a contribution to the asymptotic formula is 
obtained from one side only. For this reason when a = 0 or x formula (4.39) 
and the subsequent formulas are out by a factor of }. 


5. The general asymptotic formula. In the general case we have seen 
that the dominant function can have more than one maximal asymptotic 
angle. We assume that there is at least two such angles and denote the set of 
maximal asymptotic angles by ¢, (j = 1, 2, ...,¢; ¢ > 2). Under these 
assumptions neither of the angles 0 or + can be a member of the set. 

From Theorem (2.8) we have seen that corresponding to a fixed value of j 
there corresponds a unique solution for r of the equations 


(5.1) 0 P(r) = n, 
(5.2) arg T = @;. 
Let us denote this solution by 
(5.3) r,= Rie™, 
Without difficulty one can show that R,, R, are two different solutions for 
t| such that 
(5.4) lim (R,; — R,) = 0. 


n> 


Now B,, is given by 
(5.5) B, = —. f z "*” exp(P(z)) dz, 
Cc 


where C is any closed contour enclosing the origin. Instead of a circular contour 
we choose C to be given as follows: 

A. Range 0 < 0 < 1 + €:1, where €; = R,“-*™/8, In this range C is the 
circular arc z = R, e®. 

B. Range 


o; + €; qo¢ P 541 + €j+15 ] <j qt-—l], aie RP-™"s, 


For this range C is the circular arc z = R441 e®. 
C. Range 
+e. f0anr, GF = RS, 


Here C is the circular arc z = R, e**. 

D. In order to make a closed contour we join all the circular arcs by radial 
lines at the end points of each arc. 

E. Range —x < 0 <0. The contour C for this range is taken to be the 
mirror image of the contour for 0 < 6 < =. 


for 


ial 


he 
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By using Theorems (2.5) and (2.7) we can show that only the portions of 
the contour $, — «; < 0 < o; + €, j = 1,2,...,¢ and their mirror images 
contribute to the asymptotic formula. Hence 


lol st ('*** exp[P(R, ec“) — 
(5.6) B~e a > expiP (Rs 6") — sath at} 

T = oj—¢j R; 
Each of the maximal asymptotic angles can then be treated individually by 
the method of the previous section. This leads to the asymptotic formula 


Cc) t 


! 
(5.0) B, ~ = xf ) p> by, 77" exp(P(r,)) 17 (40° P(r,)) ] . 


k=O ju 


where © P(r,) means [6 P(r)],.,;. Similarly for 6? P(r,). Further the +, are 
given as the solutions of the equations 


(5.8) 6 P(r) =n 
and 
(5.9) arg rT = oy, 


where ¢, is a maximal asymptotic angle of the dominant function of P(x). 
Finally the },, will have formulae analogous to (4.38). To obtain explicit 
formulae, expansions about each maximal asymptotic angle are involved. 


6. Example. We have chosen as an example to illustrate our method the 
Hermite polynomials H,(t). Szegé (3; p. 194) gives an asymptotic expansion 
of these polynomials and the method of proof divides the expansion formula 
into two cases according as m is even or odd. Our method makes no such 
separation and the two cases are treated as one. 

The generating function of H,(t) is 


(6.1) exp(2tx — x") = >> H,(t) =. 


In the notation of the present paper: 
(6.2) P(x) = 2tx — x?, 
(6.3) S(R, 0) = 2tR cos @ — R* cos 28. 


It is easily seen that the dominant function has a single maximum at 
6 = ¢; where @¢; is given by 


(6.4) ¢; = arc cos (t/2R). 
The value of 7 is determined by 
(6.5) 2tr — 2r? = 0, 


(6.6) arg tT = 91. 











208 LEO MOSER AND MAX WYMAN 


Hence 

(6.7) R = (n/2)', 

(6.8) ¢1 = arc cos (t/(2n)'), 

(6.9) tr = Re™' = 4(t + i(2n — #’)'). 

From (6.8) for all real ¢ and large n 

(6.10) o: = 4x — t (2n)-? — (t3(2n)-*/2/6) 

and 

(6.11) n o, = 4 nme — (4n))t + terms of order n-. 


We shall drop terms of the order n~} in our final formula to get an expression 
for the first term in the asymptotic expansion. At certain stages one must 
retain such terms. For example to obtain (6.11) from (6.10). 

From (6.7), (6.9) and (6.11) 


4 
(6.12) f = (4n)™ ettine—Ge) Heo) 
Similarly 
(6.13) exp(P(r)) = exp[} (¢2 + m) + i((4n)*t.. ], 
(6.14) [6? P(r)]? = (Qn)'+.... 


Hence by (4.41) 


(6.15) H,(t) ~ (2/r)* n! Ri(4n)-™ (exp 4(t2 + n) 
+i((2n)t — ina +... )]}(2n)-4. 


Using Stirling’s formula for n! 
(6.16) H,(t) ~ 2°**” (n/e)™ e*® cos((2n)* t — Anz). 


This term agrees to the proper order with Szegé’s formula. Other terms can 
easily be calculated and it can be seen that our method does not distinguish 
between even and odd n. 

We hope in subsequent papers to apply the method to other problems 
involving asymptotic expansions. 


7. Conclusion. In concluding this paper we would like to point out several 
possibilities for further generalizations. The most obvious generalization 
would be to obtain an asymptotic formula when the coefficients a, of P(x) 
are allowed to be complex functions of a complex parameter ¢. We have 
carried this problem far enough to be reasonably certain that our method will 
generalize to this case with a minimum of modification. A second type of 
generalization would be to allow the a, to be functions of m and obtain formulas 
of the Plancherel-Rotach type. We have found examples of this type of 
problem for which our method will apply but it does not seem likely that the 
method can be used to solve the general problem of this type. A final type of 
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generalization we have considered is the replacement of the polynomial 
P(x) by a function of f(x). We have found that the method can be applied 
to the special case of the Bell numbers (2), in which f(x) = e* — 1, but fairly 
stringent conditions would have to be placed on f(x) in order for our method 
to apply to an arbitrary class of functions. 
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A SIMPLE ALGORITHM FOR FINDING 
MAXIMAL NETWORK FLOWS AND AN APPLICATION 
TO THE HITCHCOCK PROBLEM 


L. R. FORD, JR. ano D. R. FULKERSON 


Introduction. The network-flow problem, originally posed by T. Harris 
of the Rand Corporation, has been discussed from various viewpoints in 
(1; 2; 7; 16). The problem arises naturally in the study of transportation 
networks; it may be stated in the following way. One is given a network of 
directed arcs and nodes with two distinguished nodes, called source and sink, 
respectively.' All other nodes are called intermediate. Each directed arc in 
the network has associated with it a nonnegative integer, its flow capacity. 
Source arcs may be assumed to be directed away from the source, sink arcs 
into the sink. Subject to the conditions that the flow in an arc is in the direction 
of the arc and does not exceed its capacity, and that the total flow into any 
intermediate node is equal to the flow out of it, it is desired to find a maximal 
flow from source to sink in the network, i.e., a flow which maximizes the sum 
of the flow? in source (or sink) arcs. 

Thus, if we let P, be the source, P, the sink, we are required to find 
Xiy (t, 7 = 1,...,m) which maximize 

1" 


(1) dx 


san? 


subject to 
(2) Ly (X13 — Xx) = 0 (¢@=2,...,m—1),0 < x45 < Cty, 
j 


where the c,, are given nonnegative integers. 

This is, of course, a linear programming problem, and hence may be solved 
by Dantzig’s simplex algorithm (3). In fact, the simplex computation for 
a problem of this kind is particularly efficient, since it can be shown that the 
sets of equations one solves in the process are always triangular (1). However, 
for the flow problem, we shall describe what appears to be a considerably 
more efficient algorithm. 

Of some theoretical interest is the fact that the procedure assures one of 
obtaining a strict increase in the total flow at each step (in contrast with the 
simplex method). In addition, the Hitchcock transportation problem (4; 12; 
14) can be solved via the flow algorithm in a way which naturally generalizes 


Received April 16, 1956. 
14 problem in which there are several sources and sinks, with flows permitted from any 
source to any sink, is reducible to a single-source, single-sink problem. 
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the combinatorial method recently proposed by Kuhn (15) for the optimal- 
assignment problem. Informal tests by hand indicate that this way of solving 
the Hitchcock problem is extremely efficient. 


1. The minimal cut theorem. A nonconstructive proof of the minimal 
cut theorem, which asserts equality of maximal flow value and minimal cut 
value,? has been given by the present writers in (7). Subsequently a construct- 
ive proof based on the simplex criterion of linear programming was developed 
(1). The algorithm which we describe in the next section also provides a 
constructive proof of the theorem in case the c,, are integral (or rational),* 
as we have assumed for this paper. Like the simplex algorithm, it not only 
produces a maximal flow but a minimal cut as well. This will be important 
for our application to the Hitchcock problem. 

It should perhaps be pointed out that an undirected problem, by which we 
mean that the directions of flow in intermediate arcs are not specified, so that 
the capacity constraints on these arcs are of the form 


(3) Xig H Kye S Cus (i,j =2,...,.m—1;4< jf), 


presents nothing new, since we may replace each undirected arc by a pair of 
oppositely directed arcs, each with capacity equal to that of the original arc; 
i.e., replace (3) by 


(4) Kay S Cay, Kye KS Cay. 


For, given X’ = (x,,/) satisfying (4) and the conservation equations at 
intermediate nodes, setting 


(5) Xj => max (x,,' —_ Xs, 0) 


yields an equivalent flow X = (x,,) satisfying these equations and (3). 

The minimal cut theorem is true for undirected networks as well as for 
directed networks, or more generally, for mixed networks in which some 
intermediate arcs are directed, others not, the obvious changes in definitions 
of cuts and chains having been made. This follows from the comments above 
and the fact, easily proved, that the minimal cut value is the same for a mixed 
network and its equivalent directed network. The theorem is still valid when 
capacity constraints on nodes are admitted, where a cut now, of course, 
includes nodes as well as arcs. This may be proved by splitting each node into 
two nodes as suggested in Fig. 1, and placing the capacity c of the old node 
on the new directed arc joining the two new nodes, thus obtaining an equiva- 
lent network with capacity constraints on arcs only. For a direct proof of the 
theorem in this general form, see (1). It is also shown there (and follows 


*A cut is a collection of arcs which meets every directed chain from source to sink; the 
value of a cut is the sum of capacities of its members. This definition of a cut corresponds to 
that of disconnecting set given in (7). 
5No restriction of this kind is made in (1, 7). 
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Fig. 1 
from the algorithm of the next section) that for integral c 
maximizing flow exists. 

There are several well-known combinatorial theorems which can be viewed 
as rather direct consequences of the minimal cut theorem and the existence 
of an integral maximizing flow. Among these we mention Menger’s theorem 
on linear graphs (13), Dilworth’s chain decomposition theorem for partially 
ordered sets (5), and Hall’s theorem concerning set representatives (11). 
For proofs, see (1; 8; 9). 


ij, an integral 


2. The algorithm. We start the computation from any convenient 
integral flow whatsoever. The initial flow is used to define a starting matrix 
A = (a,;) by letting a,,; be the capacity of the arc from P, to P, diminished 
by the flow from P; to P,; and increased by the flow from P, to P;. If no 
other flow is readily available, one may start with the zero flow, corresponding 
to A = C, where c,,; is the original capacity from P, to P,. 

For certain values of 7 = 1,..., n we shall define labels v,, u,; recursively 
as follows. 

Let v; = ©, uw, = 0. For those j such that a,, > 0, define v; = a,,, wy = 1. 
In general, from those i which have received labels v,;, un; but which have not 
previously been examined, select an i and scan for all 7 such that a,, > 0 
and v,, uw; have not been defined. For these j, define 


(6) vs = min (v4, Gis), wy = 1. 


Continue this process until v,, u, have been defined, or until no further defini- 
tions may be made and 2,, u, have not been defined. In the latter case the com- 
putation ends. In the former case, proceed to obtain a new a,, matrix as 
follows. 

Replace Gyan DY Gyan — Un ANd Gpy, DY Any, + Mm. In general, replace a,;,; by 
Gy;3 — Um and dy; by Am; +, where each j is the wy of the preceding 7’ in 
the backward replacement. This replacement continues until 4, = 1 has been 
completed. The labels v,, u,; are then recomputed on the basis of the new A 
matrix and the process is repeated. 

Notice that », is a positive integer, and hence the process terminates. 
Upon termination, the maximal flow X is given by defining 


(7) Xiy = max(Cy; — A4;, 0), 


as we now prove. 
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LEMMA 1. X is a flow. 


Proof. Clearly 0 < xi; < ¢i;, since a,; > 0. It remains to show that X 
satisfies 50, (xi; — x;,) = 0 (i = 2,...,”— 1). Now the process ensures 
that cy, + Cy, = Qi, + @;,. It follows from this and the definition of X that 


p> (X43 — X44) = p> (Coy — Gy). 


It therefore suffices to prove that >>, a,; (i = 2,..., — 1) is invariant 
under the computation, as certainly >>, (¢,; — a;;) = 0 for the starting 
matrix A obtained from a flow X. But if i (#1, m) is the un; of some /, then there 


isa k = p,. Thus, for this 7, the new a,, are either equal to the old a,, or are 
given by 


Gi; —% forj = l, 
ay = 1413 + Op for j7 = k, 
aij otherwise; 


hence > 5 a: = Dd 5 a4. 
LEMMA 2. X is a maximal flow. 


Proof. At the point where termination occurs, we have defined a set S 
of nodes, consisting of those nodes P, for which v,, uw; have been defined; 
and further, P,; € S, P, ¢ S. Consider the set T of directed arcs P,P, such‘ 
that P, € S, P, ¢ S. Clearly a,,; = 0 for such pairs i, 7, as otherwise we would 
have defined v,, u,;. 

We will show that I is a cut whose value is equal to >>, x1,;, thus proving 
that X is a maximal flow and T a minimal cut. 

That T is a cut is clear. For if there were a chain 


P,P,,...PaPs, with the arcs PiP,, ¢1T,...,PaP, ¢ IT, 
then we could deduce successively (since P; € S) that 
P, €S,...,% €5, 
which is a contradiction. To see that I is equal to the flow value, notice that 
0 (l<i<n), 


|Da, (t = 1). 


Now sum these equations over those i for which P,; € S. On the left side» 
if P, and P, are both in S, then c,, — a,, and cy, — a,, are both in the summa- 
tion and are negatives of each other. All that remain are terms of the form 
Ciy — Qyy, where P,; € S, P, ¢ S. For these, as we pointed out above, a,, = 0. 
Thus the sum on the left reduces precisely to the sum of capacities of members 
of I’, as was to be shown. 


ps (C1y — Ges) = 
7 


‘The set T is actually the set of ‘‘left arcs” defined in (7) 
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3. The Hitchcock problem. The Hitchcock transportation problem 
arises frequently in applications of linear programming. We shall propose 
another computation for the problem which will amount to solving a sequence 
of flow problems of a particularly simple kind. The basic idea, which stems 
from a proof given by Egervary (6) for a theorem of Kénig (13, p. 232) on 
linear graphs, has been used by Kuhn (15) to develop a very efficient combina- 
torial algorithm for the optimal assignment problem, a special case of the 
Hitchcock problem. Our method differs only in details from the Kuhn algo- 
rithm in this case. 

We take the Hitchcock problem in the following form: Given a matrix 


D = (d,,;) of nonnegative integers, and two sets of nonnegative integers 
(@1,~ <> Q@m), (01, .~ 5-Og), with 

m na 

24 = Z by 

i=] j=l 


it is desired to find a matrix X = (x,,) satisfying the constraints 


(8) Xiy > O, z=. Xiy = Qi, > Xiy = b; 
P| ri 
which minimizes the linear form 
(9) > diy X43. 
J 


A physical interpretation of the problem is that there are m originating points 
for a commodity, the ith point having a; units, and m destinations, the jth 
one requiring 5, units. If d,, is the cost, per unit of commodity, of shipping 
from origin i to destination j, find a shipping programme of minimal cost. 

The dual (see (10) for linear programming duality theorems) of the Hitch- 
cock problem is: Find a;, 8, satisfying the constraints 


(10) a, +B; < di; fe ae ae 2 ae n) 
which maximize 
(11) DY ae, + D> b,8;. 

i j 


The proof that the algorithm to be described yields a solution to the Hitchcock 
problem will be based on the fact that the dual form (11) increases by at least 
one unit with the solution of each successive flow problem. Since (11) is 
bounded above by (9), this suffices. 

Each of the flow problems will be of the following form. Find X = (x,, 
satisfying 


Xiy > O, Do Xs < ay, D x < 5; 
5 | i 


1 ‘ Laer 
(12) x43 = O for a given set Q of pairs 4, j, 


which maximizes 


(13) >, X40. 
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This is actually a special Hitchcock problem. To see that it is also a flow 
problem, set up the directed network of m + n + 2 nodes shown in Fig. 2, 
where the capacity on the directed arc P,Q, is zero if i, 7 € Q, large otherwise, 
and the capacities of source and sink arcs are the a, and b,, as shown, and 
interpret x,, as the flow from P, to Q,. 








Fig. 2 


To solve such a problem we may of course use the computation of §2 
involving, in this case, an m + n + 2 by m+ n + 2 matrix. It is possible 
(and computationally convenient) to describe the process in terms of an m 
by m array. The verification that the two descriptions are the same for this 
particular class of problems will be left to the reader. 

Let X be an integral solution of the constraints (12). Corresponding to 
certain of the rows i = 1,...,m of X we will define integers 2,, u,; similarly, 
for certain of the columns j = 1,..., we will define integers w,, \,. These 
definitions will be made recursively; first a set of v,, u, will be defined, from 
these a set of w,, \, will be defined, and so forth, alternating between the rows 
and columns. 

For those i such that }>,x,, < a,;, define 


(14) 0, = ai- Dd xi ws = 0. 
| 


Next select an i which has been labelled and scan for all (unlabelled) 7 such 
that 7, 7 ¢ Q; for these 7, define 

(15) W, = Vy, A; = 1. 

Repeat until the previously labelled i’s are exhausted. We then select a 
labelled 7 and scan for unlabelled i such that x,, > 0; for these i, define 

(16) Vv, = min(xX4;, Wy), Be = J. 


Repeat until the previously labelled j’s are exhausted. Again select one of the 
's just labelled, look for unlabelled 7 such that i, 7 ¢ Q, and define w,, A, by 
(15). Continue in this fashion, using (15) and (16) alternately until either 
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w,, 4, have been defined for some j with >>; x,,; < 5,, or until no further 
definitions may be made. In the latter case X is maximal by Lemma 2; in the 
former we can get an improvement as follows. Let 

























































































(17) v = min(w,, b, — > X44). 
Alternately add and subtract v from the sequence 
(18) rj 9 Dj fr» Str grs Ltr ges =~ + » Sty-rgar Xtages 
where 
ji = Min i= Any Je = Kin» = Ajo tee Je = Big» h = Ayes 
and yp, = 0. 
i .  M, 
! 2 3 4 5 6 7 8 i i 
1 G) G) ! 3 
2 @ 
3 G) ! 2 
41 @ ® ' 
is ° ® Ol1O 
5 C) : @) i | 0 
71©O 9 
- e) G) ©) i | 4 
4 C) 2] 0 
10 © | @ 1 | 4 
a 
bl i | it | i | eo} 9 | 2] 2 
“ilo ! 1 1 1 ! 
sl 7,6)6) 5 4 | 10 9 
Fig. 3 


Example. In Fig. 3, cells containing a circle comprise 2 (the complement of 
2). The entries within the circles (zeros elsewhere) constitute an X satisfying 
the constraints (12). The defining process terminates with ws = 1, As = 4, 
since column 5 is a column in which the sum of the entries x;,5 is less than )5; 
i.e., 3 + 3 < 9. The sequence along which an improvement of 


min (ws, bs — >, x) = 1 


can be made is x45 = 3, x4, = 1, x71 = 0, X79 = 1, x99 = O, as is easily read 
off using the \’s and y's alternately. 

We are now in a position to describe a general routine for the Hitchcock 
problem. As a starting point, form the difference matrix 


1er 


he 





2ad 


ack 
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tw 
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(19 diy — a; — B;, 
where 


ay min dij, B; = min (d;; = Gz) 
4 t 


Thus d;; — a; — By > 0; i.e., ay, B, satisfy the dual constraints (10). The 
next step is to solve the flow problem with 


(20) Q = {ijld,, — a, — B, > 0}. 


If the maximizing flow X satisfies 
Leu = Las 
t.J t 


then X is a minimizing solution to the original Hitchcock problem. 

This is easily deduced as follows. Observe first of all that for any a,, 8,, 
the two Hitchcock problems with cost matrices d,;, and d,, — a; — 8, are 
equivalent; for }>,x,, = ay, 5; x,, = 6, imply that 


> (di; — ay — By) X43 = Dd digs _ D a: a; = Dd 4; B;, 
i,j 1.7 { J 


and the last two sums on the right are independent of x,,. Thus if we have 
a;, 8, with d;, — a; — 8, > 0, and are able to find an X satisfying (8) and 
v4, = 0 for ij7 € Q, then clearly X minimizes > ;,; (di, — a; — 8,)x;), hence 
minimizes >>, , di; Xj). 

If, on the other hand, }>;,;x:, < 4; a, let J be the index set of the labelled 
rows of X, J the index set of the labelled columns, and define new dual 
variables by 


, ja, +k (7 € I), 
az = 4 
(21) la, (1 ¢ I), 
ities j8,—k GE J), 
18, i¢ J), 


where k = min(d,, — a, — B;),i€ 1,7 ¢J. Notice that k > 0, since pairs 
i,j with 7 € J, 7 ¢ J are contained in Q. 


LEMMA 3. Do;a,as + 0 ,b,8B/ > Yaya, + Xb, B,. 


Proof. The fact that X is a maximal flow implies that }>;x,, = 6, for 
j € J. Also, the labelling process ensures that x,, = 0 for i¢ J, 7 € J, and, 
as we have mentioned earlier, x,, = 0 for i€ I, 7 ¢J. Since all i with 
> «1, < a, are in TJ, it follows that 


‘> a;> = > xy = db, 


ied tel jet 


and hence the new dual form has been increased by the amount 


k(>o a,— > d,) > 0. 


ie] jel 
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Another way to see this is to note that the minimal cut in the associated 
network (Fig. 2) has value }-a,; + 5b, < Soa; and hence 
tel det ‘ 


pm a;> > b;. 
ter jet 
Now form the new difference matrix d,;; — a; — 8; > 0 by subtracting e 
from the J-rows of the previous difference matrix, and adding k to the 
J-columns. We may now take the maximal flow X of the previous flow problem 
as a starting point in the new flow problem and proceed as before. 
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ON A DIOPHANTINE PROBLEM 
J. B. ROBERTS 


Introduction. If a;, a2,..., a, are relatively prime positive integers then 
the equation 
(1) Q:%,+...+4x%, =n 
always has solutions in non-negative x, for n sufficiently large. Sylvester 
called the number of non-negative solutions of (1) the denumerant of the 
equation. We shall denote the denumerant by n(a,,..., a;). 

We define N,(a:,...,a@,) to be the smallest positive integer such that 
n(a,,...,@,) >j for all m > N;(ay,... , a). 

Using the known result 

n(1, 2,3) = [(m? + 6m + 15)/12] 


we can readily show that 


N,(1, 2, 3) = [/(12j — 6) — 2]. 


In general the computation of N,(a,,..., a,) is a difficult job. Our interest 
here is in N,(a,,...., a,). The present author has given (4) the value of 
N,(ai,..., a,) when the a; are in arithmetic progression. Brauer and Seel- 
binder (2; 3) have given various upper bounds for Ni(a;,..., a,) in the 


general case. In this paper we concern ourselves with an upper bound for 
Ni(m,m + a,m + 6) where (a,b) = 1. Our upper bound is in many cases 
best possible. We also give a theorem concerning an upper bound for 


Ni(m,m + ay,..., m + a;) 


where a, d2,..., @ are relatively prime positive integers. 


1. Three lemmas. Throughout this section 
0<a<b), (a,b) =1, P= (a-—1)(6-1 


Since N,(a, 6) = P (1, p. 124), the equation ax + by = n is solvable in non- 
negative integers for all m > P. The non-negative solution with smallest x 
is denoted by x,, y,. If x, y is any non-negative solution of ax + by = n then 
XY D Xe t Vn: 
Also 
Xn+d + Vn+bd = Xe + Vn + & 
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Since (a, 5) = 1, the numbers aa for 0 < a < 6 — 1 constitute a complete 


system of residues modulo 6. Hence n = aa(mod b) has a unique solution with 
0 <a < b — 1. Denote this solution by a,. 


LEMMA 1. xX, = Gq, Yn = (m — da,)/b. 
Proof. Clearly 
da, + b((nm — da,)/b) = n. 
Hence all solutions of ax + by = n are given by 
a, + bt, (n — aa,)/b — at. 
For non-negative solutions we must have 
t>—a,/b> —1. 
Hence all non-negative solutions have x > a,. 


LEMMA 2. If n>P, a =b—1 and 1<c<qn—P then 
Xu tT In D> Same TF Yao T | 


Proof. We have 
Xn + Vn = Oy + (mn — aa,)/b = (n + a,(b — a))/b 


= (n+ (b — 1)(b —a))/b > (n —c + a,_.(b — a))/b 
Xn—c + Va—e- 


LEMMA 3. 


max (xX, + Yn) < 6 — 2 + [m/b] 


neK 

where K consists of those n satisfying P Cn < P+m—1,m>1. 

Proof. Let B be the smallest integer greater than P + m — 1 which has 
ap = 6 — 1. Then 

B>P+m-—1>n 
and therefore B — n > 1. Also B — n < B — P. Hence taking the n, c of 
Lemma 2 to be B, B — n respectively, we see that 
Xp + Ye > Xp—(p—n) + Va—(e—n) + 1 = Xn + Yn + I 

This is true for all » € K, so 


max (Xn + Yn) < Xp + ye -—ji= (B + ap(b — a))/b — 1. 


neK 
For arbitrary m we have 
B= P+ [m/b)b—1+4+5. 
Hence 


max (x, + yn) < (P + [m/b])b —1+ 64+ (6—1)(6—a))/b-1 
neK = b a 2 = [m /b). 














wh 














ON A DIOPHANTINE PROBLEM 2: 


tw 


2. The main theorem. 


THEOREM 1. N,(m,m + a,m + b) < m(b — 2 + [m/b]) + (a — 1)(6—1) 
where 0 <a <b, (a, b) = 1, m > 2. 
Proof. Let P, K, x», ¥, be as in §1. Define 
Q = max (x, + a). 


nek 


Now define Z,, §,, 2; for 7 > P by the following : 


Ri 


i= Xy 9; = ¥: for j = 7 (mod m), P<i¢cP+m-—l1, 
,=Q+(G — P)/m) — Z; — §;. 


NI 


Then 
mz, + (m + a)é; + (m + b)G; 


(2) m(2; + 2; + §;) + at, + bg, 
= mQ + m([(j — P)/m] + az, + b9;. 
As 7 runs over 
P+sm, P+ sm+1,...,P+sm+m-— 1 
for s > 0, the right side of this equation runs over 
mQ0+ms+P, m+ms+P4+1,...,mQ+ms+P+m —1. 

Hence as j runs over the integers greater than or equal to P and s runs over the 
integers greater than or equal to 0, the left side of (2) runs over the integers 
greater than or equal to P + mQ. Replacing Q by its upper bound from 
Lemma 3 now gives the desired result. 

Essentially the same proof yields the following : 

THEOREM 2. N,(m, m+ a,,...,m +a.) < P + mQ where a;, ao,... , a, 
are relatively prime positive integers, 


P = N,i(q,...,@e), Q = max (Xin +... + Xen), 


neK 


K is the set of n such that P Cn <P +m —1, and Xiy,..., Xen 18 @ non- 
negative solution of (1) with smallest sum. 


3. Special cases. Let Ni(m, m + a, m + 6) denote the upper bound in 
Theorem 1. We compare N, with N, in a few special cases. 
(a) By the main result in (4; see also 2, Theorem 7, p. 310), 
Ni(m, m + 1, m + 2) = m[}m] = N,(m, m + 1, m + 2). 
(b) By a result stated in (4), 


Ni(m,m + 1,m + 5) = Ni(m,m + 1,m + 5d) 
for 


m=-—1(modbd), m>b*? — 5b+3; 
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and 


Ni(m, m + 1,m + b) = Ni(m, m + 1, m + 6) — (m — b[m/b]}) 
for 
m = — 1 (mod 9), m > b? — 4b + 2. 
(c) By direct evaluation it is not difficult to show 
N,(m, m + 2,m + 3) = N,(m, m + 2, m + 3). 
(d) By computation for 2 < m < 16 we find 
N,(m, m + 2,m + 5) = Ni(m, m + 2, m + 5) 
for 


5, 8, 10, 13, 14, 15 


3 
il 


and not for 


3 
II 


2, 3, 4, 6, 7, 9, 11, 12, 16. 
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SETS AND SUBSERIES 


G. M. PETERSEN 


Suppose a, ~ 0 for all m. By replacing a set of terms from the series }-a, 
by 0’s, we form a subseries of the original series. These subseries can be put 
in one to one correspondence with the non-terminating binary expansion of 
the points on the real line segment (0, 1). The point & = .b,:b....05,... shall 
correspond to a subseries if and only if 6, = 0 whenever the mth term of the 
original series has been replaced by 0 and b, = 1 whenever the mth term is 
retained. We can now speak of sets of subseries of the first category, measure 
zero, etc. 

If we have a series }-a, whose terms {a,} form a positive monotone decreasing 
sequence, 4; > d2 >... >a, >..., then it is easy to show by a Dedekind 
section the existence of an index p such that }-a,* converges for g > p and 
diverges for g < p. The object of this note is to show that if the terms of a 
series are a positive monotone decreasing sequence, the set of subseries that 
have an index differing from the original series is of the first category. 

We first prove a lemma. 


LemMMA. If {c,} is @ positive monotone decreasing sequence, > c, diverges, 
lim sup nc, * 0, then the set of subseries that converges is of the first category. 


Proof. We can assume that 2(m)Conim > k where {n(m)} is a sub- 
sequence of {m}. However, this implies (m)Caim > 32 and vc, > $k for 
n(m) < v < 2n(m). We can associate with this sequence a regular summation 
method A, defined by 

1 2 n(m) 

a n(m) Sin)’ 

Suppose now that c’, = c, if the term is retained for the subseries and c’, = 0 
otherwise. For any subseries }°c’,, we shall choose a sequence of 0's and I's, 
{s’,}, so that s’, = 1 if c’, #0 and s’, = 0 otherwise. There is an evident 
correspondence between the sequence {s’,} and the point corresponding to 
the subseries. We now see that if }-c’, converges, {s’,} must be A summable 
to 0. For if tn, 2 A> 0 for an infinite set {yu}, then 


2 n(m,) k 
, es > r n(m,) C2n(m,) > d 9 
v=n(m,) 7 


and the series diverges. 
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However, it has been shown by Hill that the set of points on (0, 1) in the 
binary expansion corresponding to such a set of 0’s and 1’s is of the first 
category. 

We are now ready to prove our theorem. 


THEOREM. Jf {a,} is a positive monotone decreasing sequence, the set of 
subseries of >a, with a different index is of the first category. 


Proof. The index for the series }>a, may be assumed to be 1. For if the 
index is p, >-a,” will have an index 1 and the set with a different index will be 
the same for }-a, and ¥a,?’. 

If r < 1, then lim sup n a,” > 0, for if 

lim na,’ = 0, 
then a, = O(m-"") and for r < g < 1, >a," would converge, contrary to our 
assumption. Hence the set of subseries of }-a,” that converges for any r is of 
the first category. 

If a subseries has an index less than 1, then it will belong to the set £, of 
convergent subseries of 

TT ad vy = 2,3, 


for some v. The set of all subseries with index less than 1 will be contained in 
the union of these sets and so will be of the first category. 
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SHORT PROOF OF A MAP-COLOUR THEOREM 
G. A. DIRAC 


Heawood (3) showed that for h > 1 the countries of any map on a surface 
of connectivity 4 = 3 — x can be coloured using at most 
nm, = [3(7 + (24h — 23))] 


colours. In a previous paper (1) I proved the following 


THEOREM. For h = 3 and for h > 5 a map on a surface of connectivity h 
with chromatic number n, always contains n, mutually adjacent countries. 


A new short proof of this will now be given. It is based on the following 
result (2): 


If a critical k-chromatic graph contains N > k nodes and E edges then 
(1) 2E> (k-1)N+k —3. 


(A graph is called critical if deleting any arbitrary node or edge reduces the 
chromatic number. A k-chromatic graph always contains a critical k-chromatic 
subgraph, and a critical graph is finite and connected (1, p. 481).) 

To prove the above map-colour theorem it is sufficient to show that for 
h = 3 and hk > 5, if a critical k-chromatic graph with more than k nodes is 
drawn without intersection of edges on a surface of connectivity hf, then 
k < m, — 1. Let such a graph have N nodes and E edges. By (1) and Euler's 
Theorem (1, 2.1), we have 
(2) (k-—1)N+k-—3 <¢ 2E < 6N + 6(h — 3). 

Further (1, 1.3), NV > k + 2. 

For the case when h = 3, we observe that m; = 7 and, from (2), k < 6. 

For h > 5 it may be assumed that k > 6, so it follows from (2) that 
(k — 7)(kR +2) < (Rk —7) N < Gh — 15 — , 
whence 
k<2+ [V(6h + 3)). 

Thus k < m, — 1 when 

[V/ (6h + 3)] < [$1 + V/(24h — 23))). 
This is clearly true if 

Vv (6h + 3) < 4(1 + V/(24h — 23)), 

that is, if k > 13. It can be verified also for h = 5, 6,..., 12. This completes 
the proof of the Theorem. 


Received December 31, 1956. 
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CORRECTION TO “A MINIMUM-MAXIMUM PROBLEM 
FOR DIFFERENTIAL EXPRESSIONS” 


D. S. CARTER 


The author takes this opportunity to correct some misprints and to add a 
note to his paper “‘A minimum-maximum problem for differential expressions,” 
in this Journal, 9 (1957), 132-140. 


Page 134, Equation (2.5): for this equation read 

|| Vxo |] = inf { || Vx || | reX}. 
Page 137, line —5: for “eo” read “‘e*”’. 
Page 138, line —7: for “‘|£9|" read “|to|’’. 

Added note: Since the preparation of this manuscript it has come to the 
author's attention that the present problem bears a close relationship to the 
“Bang-Bang”’ control problem (3). Choosing c = 0, », = 0, and allowing the 
endpoint 6 to vary, it is easy to show that the value of ||go|| at the solution 


is a continuous monotone function of b. The value of b for which ||go|| = 1 
provides the solution to a ‘“‘Bang-Bang’’ problem of a rather general type. 
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UPPER AND LOWER BOUNDS FOR THE AREA 
OF A TRIANGLE FOR WHOSE SIDES 
TWO SYMMETRIC FUNCTIONS ARE KNOWN 


ROBERT FRUCHT 


1. Introduction. Improving on inequalities given by Gerretsen (2), 
Beatty (1) has proved that for the area A of any plane triangle with sides 
a, b, c the following inequalities hold: 


(K —H)’~ 2. (K —H)(3K — 5H) 


(1.1) 2 > A’ > a 
where 
(1.2) H = }(2°+0?+c?), K=be+ca+ab; 


the signs of equality in (1.1) only apply when the triangle is equilateral. 
Beatty has also remarked that the second inequality in (1.1) is of no value 
in case 5H > 3K, since then the lower estimate which it gives for A? is not 
even positive. 

As an improvement on the foregoing estimates, a proof of the following 
inequalities will be given here (§2): 

2 2 
s(s — q)°(s + 2q) s(s + gq) (s — 2g) 

(1.3) a > Sp ae 
where 
(1.4) q= (a? + b? + c? — bc — ca — abd)!, 
i.e. 
(1.5) q = (4Q)', 
where 


(1.6) Q = (b — c)? + (c — a)? + (a — DB)? 


is the measure of ‘“‘unequilaterality’’ already considered by Gerretsen (2); 
and 
(1.7) oe atb+c_ (#28) 

2 2 
is the semiperimeter of the triangle.' 

The first (second) equality sign in (1.3) holds for isosceles triangles whose 
base is the smallest (largest) of the three sides; of course both equality signs 
apply when the triangle is equilateral, since then g = 0. 

Recieved April 20, 1956 

'The use of s and g as the given symmetric functions of the sides throughout the present 


note (instead of Beatty's H and K) seems justified by the fact that it simplifies the formulation 
and proof of (1.3). 
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In our (1.3), as in Beatty’s (1.1), the second inequality is useless when 
5H > 3K, i.e. when g > }s. It is easily seen that in this case triangles can be 
found with as small an area as one might wish; hence no other lower bound 
for A can then exist than the trivial one: A > 0. 

Apart from this exceptional case (and that of an equilateral triangle), 
(1.3) represents an improvement over (1.1), which might be rewritten in terms 
of s and q as follows: 


(1.8) ( q )? > A’ > (s° mea gq) (s° ee 4q°) ; 


2 2 
Ss — 
27 27 


by using the easily verified formulae 





(1.9)- H = 3(2s?+q%), K = 4(4s? — q?). 


Indeed, Beatty’s upper bound in (1.8) is higher than ours in (1.3), since the 
difference between them is 





2 2,2 2 P 2 _ 2 
(1.10) a). = = ars +29) _4 c— a. Ae. 


o aé vA 


analogously from 


= @ 





s(s + 9)*(s— 2g) _ (s*—q°)(s* — 4q*) _ 2¢°(s + 
(1.11) ot 4 


27 


- 


(q < $s) it follows that our lower bound in (1.3) is higher than Beatty's in 
(1.8). 

Incidentally it might be remarked that a new simple proof of (1.1) is thus 
obtained by deriving (1.8) from (1.3). 

Moreover, it can be shown that in a certain sense our estimates (1.3) are 
best possible; indeed, the bounds given by (1.3) for A? are nothing else than 
the extremal values that can be attained by A? considered as a function of 
the three positive variables a, b, c subject to the constraining relations (1.4 
and (1.7) and furthermore to the obvious inequalities 


(1.12) b+c>a, c+a>d>b, at+db>c, 
equivalent to 
(1.13) ets, 24, €€h 


Of course, the given values of s and g (besides being positive) have to 
satisfy the inequality 


(1.14) q<s 


(equivalent to Beatty’s H < K) as a consequence of the easily verified 
identity 


2 
(1.15) gz=s'— 3( ac + #) . 


loa 


- =—<— «-«- 
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hen To prove the foregoing statements it would only be necessary to find these 
. be extremal values of A? by standard methods of differential calculus, although 
und \ some caution is necessary because of the inequalities (1.13).? It seems, however, 
that the proof of (1.3) given in §2 avoids this difficulty, and at the same time 
le), has the advantage of being purely algebraic. (The use of trigonometric func- 
rms tions is not essential, as will be pointed out at the end of §2.) 
A final remark (§3) will be devoted to the problem of finding bounds for the 
product of the three sides of a triangle. 


2. Proof of the inequalities (1.3). We start with the well known 
formulae (3, pp. 25-28) for the trigonometric solution of a reduced cubic 
equation 


(2.1) x?+3Cx+D=0 


the with real roots x, X2 


, X3 satisfying the condition 
(2.2) Xi + X2+ x; = 0, 
viz. (if the roots are numbered conveniently): 


| - 2./—C cos 0, x2 = 2./—Ccos(0 + 120°), 





(2.3) es 

\ | x3 = 2\/—Ccos(@ + 240°), 
) } 

where the auxiliary angle © is defined by 
adie, | (2.4) cos 36 = —43$D(—C)-**, 0< 06 < 60°. 
thus A possible choice fulfilling (2.2) is 

(2.5) ¥, =a—%s, x2=b— $s, x3 =e — 95; 

} are then 
than } (2.6) 3C = XoX3 + X9X1 + X1%2 
m ol 
(1.4) will be related to g—defined in (1.4)—by 

(2.7) Cc = = q?/9 

since 

2_ (6—c) + (c~a) + (@—b) _ (2 — %s) + (%s — ¥1) + (1 — %2) 

™™ 2 . 2 

= (xy Xe oa x;)° = 3 (xox a XgX\ oo X 1X2) = 9C. 

re to Hence (2.3) goes over into 

(2.8) x, = 3qgcos0, x. = 3qcos(@ + 120°), x; = }¢cos(6 + 240°), 

*Such a proof has been suggested by Beatty (in a letter to the editor): Since for a maximum 
rified % © minimum of A? in the sense of differential calculus the jacobian 
0(s, H, A*) 
d(a, b, c) 


must vanish, from (1.2), (1.7) and (1.15) it would follow rather immediately that the triangle 
is isosceles. 
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and from (2.4) and (2.7) it follows that 
(2.9) D=- s q‘cos 38. 


Let us now express A? as a function of g, s, and @. We have: 


2 
(2.10) = = (s—a)(s — b)(s —c) = (5 — ns)(5 _ w)(§ _ xs) , 


and from the identity 
(2.11) (x — x1)(x — x2)(x — x3) = x*° +3Cx + D 
it follows for x = 4s that 


(2.12) ff tseres 


finally, using the values of C and D given by (2.7) and (2.9), we are led to the 
formula*® 


(2.13) A’ = 57(s" — 3sq’ — 2q*cos 30). 
From (2.13) it is now obvious that the maximum of A? is 


(s — g)*(s + 29) 


27 





(2.14) 57(s* — 3sq° + 29°) = ° 
and it is really attained by taking 6 = 60°, i.e.—see (2.5) and (2.8)—for an 
isosceles triangle with sides 


9 y om 
(2.15) = hs @ b = 2% q) 


The minimum of A?, corresponding to the second equality sign in (1.3), 
viz. 





: s(s* — 3sq° — 2g°) _ s(s +.g)*(s — 29) 
is obtained by taking 6 = 0 in (2.13), i.e. for an isosceles triangle with sides 
2s — 
(2.17) a= as fa) , boca =, 


if that triangle exists, i.e., if the conditions (1.13) are fulfilled. From (2.17) it is 
immediately seen that this is only the case if g < 4s; but it has already been 
mentioned above (§1) that there is no positive minimum of A? in the excep- 
tional case g > }s (i.e., 5H > 3K in Beatty's notation). 

Finally it should be remarked that the use of trigonometric functions in the 


*Professor Gordon Pall has pointed out that the following more general formula can be 
proved along the same lines: 


(k — a)(k — b)(k—c) = 3¢*(p* — ip — } cos 30), where p = (3k — 2s)/2¢. 


ww 


1e 


An 
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foregoing proof might be avoided by proving instead of (2.13) the following 
algebraic identity: 


2 2 2 
(2.18) 1a" - ls" ~ 3q°)} = (+) 149° — 27(b — c)*(c — a)*(a — 6)" 


and obtaining thus the following inequality equivalent to (1.3): 


> (5? — 39°) 


2.19 lat- = 
(2.19) at - = 





= 
S97 9: 


3. Bounds for the product abc. In virtue of (1.15), one might obtain 
from (1.3), or from (2.15) and (2.17), exact bounds also for the product 
abe: 

2 . 2 2 2 
(3.1) a7 (5 — 9) (2s + g) < abe < ars + g) (2s — 9) ; 
giving us the solution of the following algebraic problem which might be of 
some interest in itself: to find maximum and minimum of the product of 
three positive variables a, 5, c, if two of their symmetric functions, viz. s 
and g, are given. Since 
(3.2) qg? = 4s? — 3K, 


it is easy to obtain from (3.1) the exact bounds for abc also in the case when 
the first two elementary symmetric functions, viz. 2s and K, are given: 


9 ‘ . . 
(3.3) 24 s(0K — 8s) — (45° -— sx)" < abc 


2) 4 2 2 r\3/2 
< 97 S(9K — 8s°) + (4s° — 3K) [. 
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THE MINIMUM OF A 
DEFINITE TERNARY QUADRATIC FORM 


A, P. DEMPSTER 


A purely algebraic solution for the problem of the extreme definite ternary 
quadratic form has been given by Mordell (2). The equivalent geometric 
problem of the critical lattice of a sphere was solved by Mrs. Ollerenshaw (3) 
under a preliminary assumption that was said to be justified by “‘simple 
considerations” (p. 297). The present treatment eliminates this weakness, 
and avoids the use of trigonometric functions, without adding undue complica- 
tions. It arose from a remark of Hilbert and Cohn-Vossen (1, p. 45) which 
states that their method for the analogous two-dimensional problem (1, 
pp. 35-37) can be generalized to any number of dimensions. However, the 
generalization is not trivial, even in three dimensions, and the four-dimensional 
treatment of Mrs. Ollerenshaw (4) involves a preliminary assumption analogous 
to that mentioned above. 

We begin by briefly recapitulating Gauss’s statement of the connection 
between the algebraic and geometric problems. Three independent vectors 
t, (r = 1,2,3) generate a system of vectors u,t, (u, integers) which lead 
from the origin to the points of a lattice. We may regard (1, u2, u3) as affine 
coordinates for the lattice points. The square of the distance from (0, 0, 0) to 
(141, U2, U3) is 


(>> u,t,)? = D>. Ars thy He, 


where a,, = t,-t,. Thus the lattice determines a positive definite ternary 
quadratic form whose minimum m is the square of the minimum distance c 
between lattice points. If the vector t, has Cartesian components (x,, y,, 2,), 
the determinant of the form is 





D = det (t,-t,) = det (x, x, + y, ¥s + 2, 2s) 
| x1 ya 21| | xa x2 xs| 
= |X Y2 Z2| | V1 Y2 | = V’, 
|X3 ¥3 23| | 21 22 23 





where V is the volume of the elementary cell (or primitive parallelepiped) 
of the lattice. Since 


the form of smallest determinant for its minimum value corresponds to the 


lattice of smallest cell for its minimum distance. Such a lattice we now seek. 
Received February 11, 1954. 
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Consider any point A of a given lattice whose elementary cell has volume IV. 
Choose a lattice point B at the minimum distance c from A, and a lattice 
point C outside the line AB, at the shortest distance 6 (>c) from A. These 
points can always be chosen so that ZCAB < }2, and the sides a = BC, 
b = CA, c = AB of the triangle ABC satisfy 


a>b>c. @eh?+c’. 
Let A and R denote the area and circumradius of this triangle, so that 
164A? = — at — bt — c4 + 2b%c? + 2c%a? + 2a*d?, $RA = abc. 


In a parallel lattice plane nearest to the plane ABC, there is a lattice point 
D whose orthogonal projection D,; on the plane ABC lies inside or on a side 
of the parallelogram ABA'C, and so, by choice of notation, inside or on a side 
of the triangle ABC. Denote by d the distance DD, from D to the plane ABC, 
so that 

V = 2Ad. 


Since none of AD, BD, CD is parallel to AB, all of them are greater than or 
equal to b. Since the triangle ABC has no obtuse angle, circles of radius K 
with centres at the vertices overlap in such a way that every point of the 
triangle except the circumcentre is inside at least one of them (3, p. 298, 
footnote). Therefore the distance of D, from at least one vertex is less than RX, 
except that it is equal to R when D, is the circumcentre. Thus at least one 
of AD, BD, CD is less than or equal to (R* + d*)!, and consequently 


b? < R? + d’, 
wit’ equality only when D, is the circumcentre. Hence 
V? = (2Ad)? > 4A?(b? — R?) 
= 1 b?(—a*t — bt — ct + 2b%c? + c*a*® + 2a*d*) 
= 4c + 4 ¢2(b? — c*) (3b? + 2c?) + 3 b2(a? — Bb?) (6b? +c? — a 


4 c8 


Vv 


with equality only when 
d? = 6? — R*, 5b =<, 
and either 
(i)a=b or (ii) 6? + c? = a’. 
These conditions (i) and (ii) are sufficient to determine two ‘‘critical’’ lattices 
A, and A, for either of which D/m? or (V/c*)? attains its minimum value }. 
In case (i), the tetrahedron ABCD is regular, and we may choose Cartesian 


coordinates 
oe, Cet th CO Eh CE, 1,9 


for its vertices. Thus 
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t, = AB = (0,1, 1), t. = AC = (1,0, 1), t; = AD = (1, 1,0) 


and 
(dou ,t,)? = (ue + us, U3 + My, Uy + U2)? 
= (us + U3)? + (us + 1)? + (1 + Ue)? 
= 2(u,?7 + uo? + U3? + ets + Ug ty + Uy Ue). 


In case (ii), the triangle ABC is right-angled isosceles. Choosing A, B and D 
as before, we now have C at (0, 1, —1). Thus 


t, = AB = (0,1, 1), t. = AC = (0,1, —1), t; = AD = (1, 1, 0) 
and 
(Sou, ty)? = (us, uy + te + U3, Uy — Ue)? 


= uz? + (uy + Ue + U3)" + (ui — U2)? 
= 2(u,? + uo? + uz? + Ue Uz + Uz &)). 


In either case, the lattice generated by the t’s consists of all points with 
integral coordinates whose sum is even. Thus the lattices A; and Ags are the 
same, viz., the face-centred cubic lattice (1, pp. 46, 56). (The points whose 
coordinates are all even form an ordinary cubic lattice, while those which 
have two odd coordinates are the centres of the square faces.) Since the two 
sets of t’s are different bases for the same lattice, the two forms are equivalent: 
there is only one class of extreme ternary forms. 
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THE PERFECT AND EXTREME SENARY FORMS 


E. S. 


BARNES 


This paper is devoted partly to a summary of some results which will be 
published in detail elsewhere,' partly to a description of a new extreme 
senary form, $s, and an n-dimensional generalization of it. 

The main result is the determination of all classes of extreme senary forms. 


There are just six classes, represented by: 


: + _ X Xj, 


6 
do = Dox 
1 


i<j 
91 = Go — X1X2, 
G2 = do — XiX2 — X1X3, 
3 = oo — (x 4X2 + X3X4 + X5X¢), 
bs = bo — B(xyxXe + XaX4 + XaX%5 + Nave + X4X5 + XaX6 + X5%o), 
ds = do — §(Qxixe + XiXs + XiX6 + Nos + X4Xq + BW5ro). 


Each of these forms has minimum M = | for integral x # 0. They are 
listed in the following table, together with the symbol used by Coxeter (3), 
the order of the group g of automorphs, the number s of pairs of minimal 
vectors, and the value of 26D/M® in decreasing order of this quantity. 


Coxeter’s 

Form symbol Order of g S 2°D/M® 
do As 2.7! 21 7 
3 96 21 13.3°/2° 
or 672 21 nd Fy 
>: Bs in Ds; 2°.6! 30 } 
os ay 144.6! 27 35 /2° 
do: Es 144.6! 36 3 


Thus there is just one class of absolutely extreme forms, represented by 
¢2; and, in agreement with Blichfeldt (2), 
6 
£ M 
Ys = max — 
f 


D~ Die) 3 
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The method used to establish these forms is the algorithm for perfect 
forms given by Voronoi (6). Hence all classes of perfect senary forms are now 
determined; these are represented by the extreme forms @o, . . . , ds, ds and 


bs = do — 3 (xX. + X3X4q + XaX5 + X 4X5). 


The forms ¢o, $1, ¢: and ¢, were found by Coxeter (3), who gives a full 
description of their properties. Previously, both Voronoi (6) and Hofreiter 
(4) had established ¢o, ¢; and ¢2 (this being Voronoi’s notation for them). 
As is pointed out by Coxeter (3), Hofreiter’s list contains a fourth form, F,, 
which is not extreme, but which is presumably an arithmetical slip for? ¢3. 

The form ¢;3 was recently discovered independently by Kneser (5) and the 
author (1), and the reader is referred to these articles for a full description of 
its properties. 

The remaining extreme form @¢. and the perfect non-extreme form 5 are 
new. Apart from its existence as the simplest known form of its type, $5 has 
little structural interest; we merely note that it has 


M=1, D= 34/2", s = 22 


and that its group g is of order 288. 


On applying to ¢6(x) the transformation 


3 6 2 5 l 4 
5 3 ] 6 4 2 
4 l § 2-1 3 
x=7| -4-1 2-2 1-3 |” 
—1 -—-2 -—3 -—4 2 l 
—-6 -—5 —4 -3 -2 1 
we obtain 
6 
(2) 4g6(x) = 20 vi + 2D. yy = 2oo(y); 
1 i<j 
or, defining y7 by the relation 
(3) Ly = 9, 
1 
(4) bés(x) = > yi. 
1 


Since 2¢) has determinant 7, and the transformation (1) has determinant 
1/7, it follows that D(¢s) = 7*/2'*. It is also easily verified from (1) that x 
is integral if and only if y is integral and satisfies 


‘ 


(5) p> ty; = 0 (mod 7). 
1 





2As Professor Coxeter has remarked to me, the correct value of 26D/M® for ¢; is obtained by 
replacing 53 by 52 in Fy. 
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Thus ¢— now appears as the case » = 6 of the form f, defined, in terms of 
its lattice, by 


(6) fa = >> Yi 
where the y, take integral values subject to 

(7) » y= 0, 

(8) > iy, = 0 (mod m) 


(where we have set m = n + 1 for convenience). This form is extreme for all 
n > 6, but the proof of this for odd m is rather involved. In what follows we 
shall restrict ourselves to even m (odd m). 

Let @;,...,@, denote the unit vectors in m-space, with coordinates 
V1). ++» ¥m- Now f, takes only even values subject to (7). It takes the value 
2 only when y = e, — e,, and clearly none of these sets satisfies (8). It takes 
the value 4 when 
(9) y=@,+@, — @. — & 
with 


(10) a+b6=c+d(modm). 


Thus M(f,) = 4. Since the lattice defined by (7), (8) has determinant 
m'. m, we have D(f,) = m®*; so 


(2/M)"D = (nm + 1)8/2". 


The number of pairs of minimal vectors is* 


s = £n(n + 1)(n — 2) (m even). 


For, for any a, 6, the number of distinct unordered pairs c, d satisfying (10) 
is $(m — 1), of which one pair is a, b. Since a, b may be chosen in 4m(m — 1) 
ways, this gives 2s = }m(m — 1)$(m — 3), as asserted. 

We now show that f, is eutactic. Regarding y,, as the redundant variable 
in (6), we have 


n 2 n 
t= (¥9.) + Lavi, 
with adjoint 


Fr=ndiyi-22 yor. 
i<j 


The linear forms associated with the minimal vectors (9) are then 


(11) Ae = Va + Vo — Ve — Va (k = 1,...,5), 


For odd n, s = 4 (n — 1)* (m + 1). 
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with y,, = 0. We now see that 
(12) 3(m — 3)F, = Dod. 
1 


For, by symmetry, each y; occurs in 4s/m = 4(m — 1)(m — 3) forms ),, 

giving a coefficient $(m — 1)(m — 3) of y? in >\,”. A pair y;, y, occurs with 

like signs in $(m — 3) forms (as above); and with opposite signs in m — 3 

forms, this being the number of solutions of d — 6 = i — j with d, b distinct 

from i and j. This gives a coefficient m — 3 — 2(m — 3) = —(m — 3) of 

yd, in >),*. Collecting these results and setting y,, = 0, we obtain (12). 
To prove that f, is perfect, we begin by defining the vectors 


f = Oy + Cip1 — Cryo — Cyr (i =] 


where all suffixes and congruences are to be interpreted modulo m. Then 
m 
1 


but any m — 1 of the f; are independent. Hence we may take a new coordinate 
system 21,..., z, defined by 


n 


Z= Dadi. 


We have now to show that if all minimal vectors z of f, satisfy a quadratic 
relation 


n 
(14) p(z) = 2 pies = 0, 
then all p,; = 0. Using (12), we shall express some of the minimal vectors in 
terms of f;,...,f,; if f,, occurs in the expression, we must of course replace 
it by 
n 
-zf, 


1 
before using (14). 
(i) Suppose first that m > 9 (and odd). We use in (14) the minimal vectors 


(15) f, + figs + see + fi, = Cis, + Ci41 = Citr+2 = Bnd 
(ry # + 1, —3) 
(16) fit+fuet+...+ fue =e,+ Citergs — Crporge — Crs 
(2r # — 1, —2, —3), 
(17) —f,; + f, + f iss + see + f ism—s = C16 + Cup — Cy_-3s — Ci-2, 
the given conditions ensuring that the suffixes on the right are distinct. 


Values of 7 giving a suffix 0(mod m) on the left will not be considered. 
We obtain at once from (15), with r = 0, 


| 


(18) p(f,) = pu = 


an 


Nc 


Th 
Pi 
the 


—— 
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Using (15) with indices r, r — 1, and subtracting, we obtain 


2P 1.147 + 2P i415 i+r + see + 2P 47-1, wer Pitr +r = 0; 


hence, from (18), 


(19) Pitter H Piss, t4r H+ H Pieri, er = O Cf. > ae 


Replacing 1, r by 7 + 1,r — 1 and subtracting from (19) gives 

(20) Pii+r = 0 (ry = 4 
From (16) with r = 1, and p,, = 0, 

(21) Pi.i42 = 0, Pim = 0; 


from (15) with r = 2 and (21), 


(22) Pi i+ + DP i+1, ae > 0. 
From (19) with r = 3 and (21) 
(23) Pi,i+s + P i+2) os = 0. 


geese 
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We now use (17), and (15) with r = m — 6; subtracting the results gives 


Pi-3,1 + Pi 3, i+1 + eee + py 3, i¢+m—6 = 0. 


This holds certainly for i = 4 and i = 1 (no suffix then being zero), whence 


Pia + Pis +... + Pim—2 = O, 
Pi .m-2 + P2.m-2 + ee + Pm 5, m—-2 = 0. 


By (20), these reduce to 
Pis + Pi.m-2 = Pi.m—2 + Pm-s, a-g = 0, 
whence 
Pis = Pu—s, m—2- 


By (23) with 7 = 1 and i = m — 5, this gives 


(24) Pas = Pm—s, m-2- 
But (22) gives 


Pa = — Pas = Poe =... = — Pm-s, m-2; 
and so, with (24), 
Psa = Pm-s, m-2 = 0. 
Now (22) and (23) give at once 


Ps.svs = 0, Pius = 0. 


This, with (18), (20) and (21), establishes that all p,, = 0 except possibly 
Pim—2 and peo m-1. That these are also zero now follows immediately by using 


the minimal vectors (from (15) with r = 1) 
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— (f. + f:) fi+f;+f4,+...+fn-1, 
— (f,,-2 + f,) f,+f.+ see + fres + fn: . 
Thus all p,, = 0, and so f, is perfect. 
(ii) When m = 7, (17) is no longer a minimal vector and the above analysis 
fails. All 21 vectors are now 


ll 


(25) f.,f¢ + fire, £6 + fii + frre (4 —_ I: ict 
and a simple calculation establishes the perfection of f, = ds. 
It is easy to verify that f,,... , f,,1 give not only a basis of the space, but 


also a basis for the lattice of f,. Hence we find the cyclically symmetrical 
representation 


f.(x) = , (4 + X41 — Xe-2 — X41) (n even), 
1 


with x, = 0, the suffixes being taken modulo m. A similar representation 
exists also for n odd. 

We now consider the group g of automorphs of ¢.; we denote by @ the 
contragadient group of automorphs of its adjoint Fs. Since g must permute 
the minimal vectors (more precisely, the pairs +m of minimal vectors), 
® permutes the associated linear forms. From (11), any permutation of the 
d, leaves F, invariant. It follows that © is precisely the group of linear trans- 
formations permuting the associated linear forms. 

From this result, we shall show that @ (and so q) has order 672, is transitive 
on the linear forms,‘ and is isomorphic to ©, K PGL(2, 7). Corresponding to 
the representation (25), we may write the associated linear forms (11) as 

Zin Zi H Sin2, 24 + Fini + Zero ‘fg > ere 
where 
Ze = Vi t+ Ver — V2 — Vi-r, 


> z, = 0. 
1 


Obvious elements of & are S: y; — yi4;1 and U: y; — ys; (where the suffixes 
are taken modulo 7). These show that @ is transitive on the linear forms. 
For Sz; = 2:41, so that the forms of each set (z;), (2; + 2:42), (2: + 21422 
are permuted cyclically by S; while 

Uzz = yr + Ys — Yo — Ya = 27 + 22, 
Uz, = v1 + V2 — Va — Vs = — (23 + 24 + 25). 

Let now @; be the subgroup of G which leaves 2; invariant (or changes 
its sign). We shall say that two of the linear forms combine if their sum or 
difference is an associated linear form. This property being invariant under 
@, G; must permute the 8 forms combining with 27, namely 
(26) 2:+22+23, —25 —27, 2 +21, —27 —2:, 

Ze +25 +26, 2%, —26—27—21, 22. 


‘Contrary to what one might expect from (12), G is no longer transitive for n > 6. 


a 


Us 


we 


res 
or 
ler 


a 


EE 


—_—_——_—_— oo 


EXTREME SENARY FORMS 241 


Using the element T of G;, of order 8, defined by 
T(z, gpee50es 27) te (—27 ==§, Ge, 
21 + 22 + 23, —Z3 — 25, 25 + 26 + 27, —Z6 — 27 — 21, 23, 27) 


we see that G;, considered as a permutation group on the set (26), is transitive 
on this set. The successive elements T" 2.(m = 1,..., 8) are in fact the 8 
elements (26), in the order written. 

Let then G;,. be the subgroup of G; which leaves z, invariant. (We note that 
@;,. must leave the relative signs of 27, z. unaltered, since z; + 2 is a linear 
form, while z; — z; is not.) G7. must permute the 2 forms 25 + 27 + 2, 
2; + 22 + 23 which combine with both 2; and 22. A direct argument now shows 
that G;. is of order 4 and consists of the elements +I, +V, where 


V(zi,...,27) = (—27 —2; —22, &, —2s, —Z2 —24, —25 —22, —Zs, 27). 


It follows that @ has order 21.8.4 = 672, as asserted, and is generated by 
S, T, U, V and —I. Since 


V = -S'T'S, U = —T°S'T’, 


we see that G@ = {S, T, —I} = {-—I} X {S, T}. 

We may identify {S,T} with PGL(2,7), consisting of the regular 2 K 2 
matrices over GF(7) (where multiples of the same matrix are identified), by 
means of the mapping 


For, as is well known, PSL(2,7) (consisting of those elements of PGL(2, 7) 
whose determinant is a square) is a simple group of order 168 generated by o 


and 
ae “s) 
p= = (2 0/’ 


and may be defined by the relations 
(27) o” = @ = (¢f)' = (co) = 1. 


Now r+ has determinant 5, a non-square, so that o,7 generate PGL(2, 7); 
we may take the further defining relation as 


(28) ; tor = oc? 60°. 


A simple calculation shows that the relations (27), (28) are satisfied if ¢, r, @ 
are replaced by S, T, T‘ respectively. We note that, under the given mapping, 
the elements of PSL(2, 7) correspond to the elements of G of determinant +1 
(expressed as integral unimodular transformations on, say, 21, . . . , 26). 
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We consider finally a conjecture made by Coxeter (3, p. 392), that every 
eutactic form with s > 3n(n + 1) is perfect, and so extreme (the condition 
s > $n(nm + 1) being of course necessary for perfection). We can use the 
form f, considered above to show that the conjecture is unfortunately false. 

Let f(x1,...,%,) be any eutactic form with s > $n(2n + 1); f, is such a 
form for even n > 10 (in fact for all m > 10). Define 


g(X1,--- Xan) = f(X1, ~~. Xn) + Lng, ---, Xan). 


Then M(g) = M(f), and is attained when and only when one f is zero and 
the other assumes its minimum. Hence the number of pairs of minimal vectors 
of g is 2s > 4. 2n(2n + 1). Since f is eutactic and the adjoint of g is a multiple 
of F(x, ...,%_) + F(%as1,.--, Xn), g is clearly eutactic. But g is not perfect, 
since it is disconnected: all minimal vectors satisfy, for example, the relation 
X1X%—41 = 0. 

I am indebted to Dr. G. E. Wall for helpful discussions on the group of ¢s. 
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1. Introduction. In complex affine n-space with a unitary metric, a reflection 
is a congruent transformation leaving invariant all the points of a hyperplane. 
Thus the characteristic roots of a unitary reflection of period p consist of a 
primitive pth root of unity and m — 1 unities. A group generated by n reflec- 
tions is conveniently represented by a graph having a node for each generator 
and a branch for each pair of non-commutative generators. For a generator of 
period p, the node is generally marked p, but we find it convenient to omit 
the mark when p = 2, as in the case of real reflections (8, p. 619). Whenever 
two such involutory generators do not commute, the corresponding nodes are 
joined by a branch which is marked with the period of their product, except 
that for simplicity the mark is omitted when this period takes its most pre- 
valent value 3. 

Nodes representing commutative generators are not (directly) joined. 
This convention has the happy result that the graph for a (completely) 
reducible group consists of disconnected pieces representing the irreducible 
components. 

For any finite irreducible group generated by m involutory reflections, 
Shephard (29) and Todd (31) showed, in effect, that the generators may be 
so chosen that the graph either is a tree or contains just one triangular circuit. 
Since the abstract group defined by 
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Ri = R} = R} = (R2R;)" = (R3R,)" = (R,R.)" = E (L;, de, ls > 2) 


is infinite (32, pp. 26-29), the three generators represented by the nodes 
of such a circuit must satisfy some further relation. One of the purposes of the 
present paper is to indicate the advantages of using the relation 


(Ri R»R;R>.)” = E, 


which is equivalent to (R2R;R:R;)" = E or (R;R,R2R,)" = E if, as can 
always be arranged, at least two of the /’s take the value 3. Accordingly, we 
complete the graphical symbol by writing m inside the triangle. This agrees 
with the notation of Shephard (29, 370-374) when m = 3; for other values 
of m there is an essential difference, as we shall see. 

The finite irreducible groups generated by three real reflections are the 
symmetry groups of the Platonic solids (26a, pp. 20, 24). We shall exhibit all 
the remaining finite three-dimensional irreducible groups generated by three 
reflections as instances of [1 1 1']" (see 3.8), which is finite when / and m 
satisfy the inequality 3.7. In particular, [1 1 1*]*, which is ‘No. 24” in the list 
of Shephard and Todd (31, p. 301), is interesting because of its connection 
with the new senary extreme form discovered by Barnes (see the preceding 
paper (4)). 

Groups in more than three dimensions are derived by adding ‘‘tails’’ to the 
graph for [1 1 1']". The graph so obtained enables us to compute certain 
numbers m,, m2, ...,m, which quickly yield both the order of the group and 
the order of its centre. 

Reflections of period p > 2 will be discussed in a subsequent paper, where 
it will be shown that the proper interpretation for a branch marked 4 or 5 is 
not 


(R,R2)4 E or (R,R-)5 = E 


but 


R,R2RiRe R»Ri RR; or R,R2R,RR; = RoR, R2RiRo. 

2. Reflections in the coordinate hyperplanes. In complex affine n-space, 
with the “‘contravariant’”’ notation (x',...,x") for coordinates, any finite 
group of affine collineations (i.e., linear transformations) leaves invariant a 
positive definite Hermitian form, say 


2.1 > D> an x’ #* (¢n = dG; ;) 
(6, p. 253). This form determines a unitary metric in which the point 
(x?,...,x") may be usefully regarded as having also covariant coordinates 
(x1,...,%n,), defined by 

2.2 x; = DY an z (j= er - 
In this notation, the form 2.1 may be expressed either as 

2.3 >> x, x? 


| 
| 





———— 


—— _ ,— 


bo 
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or as 
2.4 > 2 6" x; 4s, 
where the new coefficients a* = a” are given by 
} a Gua” = 5}, 
so that 
2.5 w= Da*%, #= Da” x,. 

Embedding the affine m-space in a projective m-space, we may define an 
affine reflection (36, pp. 109, 115) to be a homology whose centre is a point 
at infinity, and define a unitary reflection (28, p. 82) to be an affine reflection 
that leaves invariant the form 2.1 or 2.3 or 2.4. 

For a group generated by n unitary reflections, we may choose such a frame 
of reference that the centres of the m homologies are the points at infinity on 
the contravariant axes. Then R,, the kth generating reflection, leaves invariant 


all the contravariant coordinates x’ except x*. Let us suppose that it trans- 
forms x* into 
i 
_ Cix. 


Since 2.1 is invariant, we have 
k gk ak k = 
Oy xB + DY (ay x’ B + ay, x* 2’) 
jek 
i = sf I = zl = i 
- Ox >, 61% Laz + = (CX Lait + ay; 2’ D0 cx ), 
jk 


whence, by comparing coefficients of x’ #* (j # k), 
2.6 Dye = Ann Cy Cy + Ay Ex. 


In the present paper we restrict consideration to cases where R, is involutory 
(i.e., of period 2), so that 


and therefore 2a, + a,c; = 0 (j # k). Since the vanishing of a, would 
imply a, = 0 for all 7, we must have a,, # 0 for each k. Thus 


2.7 7 20 jx / One G a k), 


and R, is the transformation leaving invariant every x’ except x*, which be- 
comes 


=x - =o 
Ox Or 


2.8 > cx’ = k 22 apex’ _ e_ 2h 
(cf. 17, p. 403). Thus R, ‘leaves invariant every point on the hyperplane 
x = 0. 


in other words, the reflecting hyperplanes are the covariant coordinate 
hyperplanes. 
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Expressing R, in terms of covariant coordinates, we find 


’ sl af =k 2X. 
= ¥= Dont” = Dont! +on(2 - 2) 
. isk Oke 
24 jp Xx 
=x,—-—#= 
Age 


(14, p. 182). Thus the matrix for the covariant transformation is the transpose 
of the matrix for the contravariant transformation (as it clearly must be for 
any transformation of period 2). 


3. Groups in unitary 3-space. In the case of the ternary Hermitian form 
3.1 x'Z! + x?Z? + x9Z? — 9 (x?Z? + x9 F? + x3Z! + x'Z* + cx! ZF + Ex*Z') 
(which is semidefinite when c = 1), we have 

ay = |, Qe; = a3; = — 3, ay. = — ke. 


The three reflections, expressed as matrices, are 


—|] é § l 0 O . & 
3.2 Ri = 0 ] O}, Re:=[c —-1 1}, Rs ={0 ] 0 
0 O I 0 O l I 1 —] 


We may easily verify that these three transformations R, are of period 2, 
while their products R.R; and R;R, are of period 3. Certain abstract definitions 
obtained by Shephard and Todd (31, p. 299) suggest that we may have a 
sufficient set of defining relations for the abstract group {R:, Re, Rs} as soon 
as we have specified also the periods of the products RiR» and R,R2R3R2. 
Since 


R,R.R;R.2 = R,R;RoR; and R»R;RiR; = RoRiR3Ry, 


the relation (R,R2R;R:)" = E can be replaced by (R:R;RiR;)" = E or by 
(R3RiR2R,)" = E. 

To find how the periods of R,;Rz and R,R:R;R»z depend on c, we observe 
that the characteristic equations for 


cé-—-1 -—-é€é+1 cé+c+é -1 —é 
RR. = Cc —| | ’ R,ReR;3Re = Cc + l 0-1 
0 0 1 c+ 1 —| 0 
are respectively 
(A— 1{A+1)?—cé&} =0, A—1I{A4+1)?— (C+ HD(Et DA} = 0. 


Comparing these with the characteristic equation 


(A 1)* - (205%) x =0 
~~ —s 


or 


by 
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for an ordinary rotation through 2x/p, we see that the reflections 3.2 satisfy 
the relations 


3.3 R,? = (R2R;)* = (R;R,)? = (R,Re)! — (RiReR;R2)” = E 


if 

3.4 cé = 4c0s’~, (c+ 1)(€+ 1) = 4cos’—, 
l m 

i.e., if c = 2 cos x/l-e*"*, where s is given by 


3.5 2 cos == (4 cos” — 4 cos’ = = 1) / (2 cos ") ; 
s m l l 


The most significant cases are worked out in Table | on page 269, where we 
use the abbreviations 


V5t1 | _-1+iv3 


9 ’ 2 
(so that r?> —7r —1=0, 0? +041 = 0). 


The form 3.1, having determinant 


Git Gi2 G13) 2-c —!I 
— 1 - , 
3.6 Qo, Gee Ge3| = & —C 2 —| 
G31 G32 33 —1 —1 2) 


lip 


= 91/5 — c€ — (c+ 1)(E+ 1)} 


if. 27 27 
= 15 — 4cos — — 4cos —], 
l m 


27 27 ~ 
4 cos 7 + 4cos — < 5, 


is positive definite if 


m 
1.e., if 
en 2r 2r 
3.7 2 cos — + 2cos— < l. 
l m 


When / (or m) has the value 3, m (or /) is unrestricted. The only other possibility 
is that / (or m) is 4 while m (or /) is 4 or 5. In each case, the results of Shephard 
and Todd will enable us to establish the sufficiency of 3.3 for an abstract 
definition of the group. 

In the graphical notation of §1, the abstract group 3.3 is 


3.8 L 
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for which a convenient abbreviation is [1 1 1)", with the / omitted when / = 3. 


(We place the first node at the top, the second at the bottom, and the third 
on the right.) 


The substitution R, «> R2R;R, = R;R:R; shows that / and m are inter- 
changeable: 
3.9 (lia ~[111")' 
In the case of [1 1 1"]*, we can replace (R,R2R;R2)* = Eor (R2R3RiR;)*= E 
by the equivalent relation 
(R3R,R.)? = (RiR2R;)’, 


which enables us to identify this group with Shephard’s [1 1; 1]" or Todd’s 
G(m, m, 3) (29, p. 374; 31, p. 277). Thus Shephard’s 


is our ™ or 


(1, 1; 1)" (1 11") ~ (1 11)", 


which is 3.3 with / = 3. It follows that the order of [1 1 1]" is 6m*. The first 
two values of m yield symmetric groups: 


[lliji'~>G;, generated by (2 3), (3 1), (1 2); 
[Ll ij?~G, generated by (1 4), (2 4), (3 4). 


Since the relations 3.3 with / = 1 or 2 imply m = 3, there are no other groups 
with / or m = 1 or 2. 


The substitution R; «+ R,R;R, enables us to express {1 1 1‘]* in the form 
R,? = (R2RiR3R;)* = (R;R;)* = (RiR:2)* = (R2R;)* = E, 


which is No. 24, of order 336, in the list of Shephard and Todd (31, p. 299). 
Similarly [1 1 15]* is No. 27, of order 2160. Thus the relations 3.3 suffice for 
an abstract definition in every case. 


4. Extension to higher spaces. In accordance with the graphical sym- 
bolism described in §1, we derive from [1 1 1']" a group 


lpgr'"=([gpr')” 
in unitary (p + q+ 71)-space, by adding “tails” of p-—1, g-—1, r-1 
branches to the three nodes in 3.8. 


The isomorphism 3.9 is easily seen to be maintained when we add a tail 


to either or both of the first two nodes (which are joined by the marked 
branch): 


4.1 lpg i'}"~ pq") 
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When / = 3, we have the group 


lpqr]”, 
which involves the three numbers p, g, r symmetrically. 
As this notation scarcely differs from Shephard's (29, p. 371), we can make 


use of his results in order to interpret the graphical symbol both abstractly 
and geometrically. For instance, the group 


{1 l 2"}3, 
defined by 


4.2 Ri - (R:R;)* = (RsR,)’ = (RiR»)' = (RiR2R;R,)* 
(RiR,)? = (R2R,)* = (R;R,)* = E, 


ll 
ll 


is generated by reflections in the covariant coordinate hyperplanes when the 
metric is determined by the form 3.1 with the extra terms 


x'E4 — 3(x2#4 + xtZ?), 


In Shephard’s notation (29, p. 379), this group of order 24/* is 


[1 1; 2]* 


Extending the tail, we have [1 1 (m — 2)*]’, which is his [1 1; » — 2]', of order 
"“' n!. In fact, all his graphical symbols (29, pp. 371, 374, 379, 382, 383) 
can be amended by the following simple modification. Whenever he draws a 
triangle with a number (m or 4) inside, this inner number should be changed 
to 3. 

Let 2-*-*-" f(p, g, r) denote the determinant of the form corresponding to 
[pqr']™; for instance, 


2-—c —-!1 0| 
l-@ 2-1 0| 
fl, 1,2) = att ot . 1/ = § — 4cos + — 8 cos 
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We easily find (cf. 17, p. 426) the recursion formulae 
Jd) 


2f(p — 1,9, 7) — f(p — 2, 9, 7) 
I(@ br) 
2f(p,9,r — 1) — f(p, 9g, 7 — 2), 


with the initial values 
f(1, 1,1) = 5 — 4.cos’~ — 4cos’=, 
l m 


fl, 1,0) =4sin’>, f(p,0,7) =p +r+1, 


whence 
f(¢.ar) = (@+0D(@¢+1)4+r- 4pq(cos” 7 + rcos’ co 
(cf. 29, p. 372). The necessary condition 
f(p,a,r) >0 


yields the finite groups listed in Table I]. The identification with Shephard’s 
list can be completed as follows. 
Setting / = 3, we see that [p g r]™ occurs whenever 


4.3 pb+qt+rt+1 — 4pqr cos? ~ > 0. 
In particular, we have [p g r}? for all values of p, g, 7; but this is merely an 


unusual way of generating ©S,,,4,4:, the symmetry group of the regular 
simplex ap4,4+, in real (pb + g + r)-space. For instance, 


(213) 
is S;, generated by the transpositions 


a3), (8 4), 
(34), (36), (67). 
(2 4), 
The criterion 4.3 shows that, when m > 2, the numbers ), g, 7 cannot all be 
greater than 1. Thus every such finite group (with / = 3) is expressible as 


t.4 [pg i]l"~I[pq 1"}', 


which we shall usually write with the 1 in the middle: [p 1 q]". It is evidently 


Shephard’s [p 1; g]" (29, p. 373). 


- 


be 


tly 


~~ 
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The isomorphism 4.4, which is a special case of 4.1, has an interesting counter- 


part for [1 1 r‘]*. Consider, for instance, [1 1 2']*, defined by 4.2. Instead of 


R,, we may introduce a new generator 


R, ™= R «R;R Rsk, 
by writing 


Ry = RsRyRiRaRs. 
The consequent relations 
R,* = R3 = R3 = R = (R»R;)* = (Rui) 
= (R»RsR,R,R4R;)' = (RiR2)' 
= (R,R;)? = (R2R,)* = (RsR,)* = E 








are naturally represented by the graphical symbol 
— 

‘ 

(iii 
More generally, an alternative symbol for |1 1 r‘]’ (r =  — 2) consists ol 
an m-gon with the mark / inside to indicate any one of m equivalent relations 
such as 

(R,RoR;... RR, i -— ‘= E 

When » = 3, we simply have [1 1 1]' as an alternative symbol for |1 1 1" 


5. Commutator subgroups. I{ the words in the defining relations for a 
given abstract group @ are such that each involves an even number of letters, 
then every element is either ‘“‘even”’ or ‘“‘odd”’ according to the parity of the 
number of letters occurring in any expression for it. The “‘even’’ elements form 
a subgroup @+ of index 2. 

When @ is [pqr‘]", one possible set of p + g +r — | generators for 
is provided by the products R,R, of period 3, represented in the graph by the 
unmarked branches. (We may ignore the branch marked /, even when / = 3 
Any such product is a commutator: 


R,R, = R,R,R,R, = Ry' Rj" R,R,. 
Moreover, any commutator 
Ce |) a) ee R, 
is the product of an even number of R’s. Hence 


The commutator subgroup of |p q r'\" is |p q r'|"*. 
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In particular, the commutator subgroup of [1 11']" is [1 11']"*, whose 
generators 


S; = RRs, S2 = R3Re 
satisfy the relations 
5.1 S,° = S,° = (S,S,)' = (Si"S:)" = E. 


These relations suffice for an abstract definition, since we can reconstruct 3.3 
by adjoining to 5.1 a new element T, of period 2, which transforms both the 
S’s into their inverses (and then defining R, = S;T, Re = TSs, R; = T). 
Thus 
5.2 (11 14}"*+ ~ (3,3 | 1, m) 
in the notation of (12, pp. 77-85). (In order to interchange / and m, we merely 
have to replace one of the S's by its inverse.) 

Setting / = 3, we recognize [1 1 1]"* as a group of order 3m* considered by 
Edington (21, p. 208). This is generated by the permutations 


Si = (3m 1 2)(3 45)... (3m —3 3m —2 3m — 1), 
S. = (23 4)(567)...(3m —1 3m 1) 


which yields, for [1 1 1]", 


R, = (1 2)(45)... (8m — 2 3m — 1), 
R, = (2 3)(5 6)... (8m — 1 3m), 
Rs = (3 4)(6 7)... (3m 1). 


A more interesting case is 


5.3 [1 1 14]** ~ (3,3 | 4, 4) ~ LF(2, 7) 
(12, pp. 83-84). The generators 
S, = (01 )(2 6 4) = (1. (mod 7) 
S. = (0 2 3)(1 6 5) = (4x + 2) (mod 7) 
are transformed into their inverses by 
(0 6)(1 2)(3 5)(4 ~) = (z+4) (mod 7). 


Hence in this case we have an inner automorphism, and 


5.4 (1 1 14]}'~ ©, X LF(2, 7). 


6. Exponents and invariants. By 2.9 with a,, = 1, the reflection R, (in 
covariant coordinates) is 
x’, = Ky — Wap Xe (j=1,..., n) 
or 


x. = x. — Qa x. 


v 


n 
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Since this is the same expression as for real reflections (16, pp. 766-767), the 
characteristic equation for the product 
R, R:...R, 
is again 
| £( + 1) @y2A 143A eee Apr 
6.1 | on %A+1) on ... or | =0 
es TT 
| Ani Ane An3 B(X + 1)| 
(25, p. 20). In the case of [1 1 1']", given by 3.1,wetake ai2.= — 4c, ao,= — }é, 
while the remaining a, (j # k) are all — 4, so that the equation is 
I~A+1 —cr —d | 
6.2 |} —é A+ 1 —X | = () 
| —1 —-1 A+1 


or 


Mt — (E+ ec—1)AM— (~E+E—1)A4+1=0. 


This is unchanged when c is replaced by —é — 1; therefore, by 3.4, it involves 
land m symmetrically. 
In the case of [1 1 1]", we have c = e***/", the equation reduces to 


Ae — cA? — CA +1=0 
(31, p. 295, with c instead of @-'), and the roots are 
c, +e. 


In the case of [1 1 i]*, we have (c + 1)(€ + 1) = 2, and the equation reduces 
to 

P+ 227+ cA1+1=0 
with c = $(—1 + iV/7) when / = 4, and c = wr when / = 5 (31, p. 296). 


Similarly, we can reconstruct the characteristic equation for R,R.... R,, 
when > 3: 


for {1 1 2]*, Af — 1A? — A? +2141 = 0; 
for [2 1 2]°, AS — wrt + DA? + wa? — A+ 1 = 0; 
for [2 1 3]°, AS — wr® + adrdt — A? + wd? — A+ 1 =0 
(31, p. 298). The last equation arises in the form 
A+1 —X 0 0 0 0 
-1 A+1 —wh —\ 0 0 
0 —@ A+1 —X 0 0 -0 
0 —|] —] A+1 —d 0 : 
0 0 0 —1 A+1 — | 
0 0 0 0 —] A+1| 
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which is the natural extension of 6.2 (with c = w) from [1 1 1}* to [2 1 3]*. If 
the equation for [p gq r‘]™ is f,(A) = 0, we can use the recursion formula 


f(A) = (A4+1)f,-1(A) — A f—-2(A). 


By taking , g, r in such an order that g = 1, we may name the n generators 
(as in the above example) in the order of a Hamiltonian path (14, p. 8). We 
start at the tip of the first tail, pass to the second node (where there is no 
second .ail), and proceed along the third tail. 

In the case of [1 1 (m — 2)']*, we have c = —b — 1 where b = e?**/', the 
equation is 

(A — b) (A! — 6) = 0 


(31, p. 295, with } instead of @-'), and the roots are 


1, l a ol} (1) 1 


where ¢ = e**#@—-D! 
In each case, to compute h, the period of R,R»...R,, we observe that 
the characteristic roots are powers 


3 


€ (g=21,2,...,%; % <me<... <m™m,) 


of a primitive hth root of unity, « = e**“". Our choice of the positive value 
of s in 3.5, combined with the ‘‘natural’’ order of R;, Ro,....R, in the 
chosen product, has the effect that 


6.3 m, =h— 1. 


(By reversing either of these conventions we would have had m, = 1 instead 
of 6.3. In the case of real reflections, this important distinction disappears, as 
we have both m, = 1 and m, = h — 1.) The actual values are given in 
Table II on page 270 (cf. 16, p. 771). It was proved by Shephard and Todd 
(31, p. 289) that any finite group of unitary transformations which possesses 
a basic set of m invariant polynomial forms is a group generated by unitary 
reflections. Moreover, when the m invariant forms are chosen so as to have 
the smallest possible degrees, the product of the degrees is equal to the order 
of the group. They observed (31, pp. 283, 294) that these degrees are just 
the numbers 

m,+ 1 (j= 1,2,...,) 


Therefore the Jacobian J of the m forms has degree }>m,, which is equal to 
the number of reflections in the group (31, ‘:p. 290). Since J changes sign 
when operated on by a reflection of period 2, it follows that J factorizes into 
dm, linear forms which, when equated to zero, give the }-m, reflecting 
hyperplanes (30, p. 47). 

For instance, the group [1 1 1*]* has exponents m, = 3, mz = 5, m; = 13. 
Klein (26, pp. 446-448) described invariants f, V, C, of degrees 4, 6, 14, 
and observed that their Jacobian factorizes into 21 linear forms which, when 
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equated to zero, give the axes of 21 harmonic homologies transforming the 
plane quartic curve f = 0 into itself. 

Incidentally, we have the simple expression 
6.4 (m,; + 1) (me +1)... (m, + 1) 


for the order of the group. 


7. Central quotient groups. Another interesting property of the degrees 
m, + 1 is that their greatest common divisor 


k= (m+1, ... ,m, +1) 
is equal to the order of the centre of the group. In fact, since 
m, = —1 (mod k), 
the characteristic roots of the element 
(Rs. nor 


are 


mah /k 2s im; /k —2ri/k 
€ =€¢ =¢ ’ 


i.e., they are all equal, which means that this is the central transformation 
e~**"* 1, of period k (31, p. 280). 

By regarding the coordinates x, or x’ as being homogeneous, we pass from 
complex affine n-space to complex projective (m — 1)-space. Instead of a 
group @ generated by involutory reflections, we now have the central quotient 
group 

/ C,, 
generated by harmonic homologies (31, p. 275). Its abstract definition is given 
by the graphical symbol along with the single extra relation 


7.1 (R,...R,)** = E. 


In the complex projective plane, we have two groups: 
{1 1 14]*/ ©, ~ LF(2, 7), 
defined by 3.3 with / = m = 4 and (R,R>2R;)? = E (cf. 5.3, 5.4), which is 
Klein’s collineation group of order 168, containing 21 harmonic homologies 
(26, p. 440); and 
[1 1 15]*/G— ~ Ae, 


defined by. 3.3, with 1 = 5, m = 4 and (R,R2R;)* = E, which is Valentiner's 
collineation group (35, p. 227), of order 360, containing 45 harmonic homolo- 
gies. When the latter is represented as the alternating group of degree six 
(37), the homologies appear as the double transpositions; e.g., the generators 
may be taken to be 


R, = (1 3) (46), Re = (3 4) (566), R; = (23) (5 6). 
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In complex projective 3-space, we have 
[1 1 2]*/ Ga, 


defined by 


R; = (R2R;)* = (RsR,)* = (RiR2)* = (RiR2R;R2)* 
(RiR,)? = (RoR)? = (RsRa)* = (RiR2R3R,)* = E, 


which is Bagnera’s collineation group (1, p. 33) of order 1920, containing 40 
harmonic homologies. This leaves invariant the Maschke quartic surface 
x4 + yt! + 24 + t4 — 12xy2t = 0 


(27, p. 504; 20a), and a configuration consisting of five tetrahedra (29, p. 382), 
namely the tetrahedra (ab), (ac), (ad), (ae), (af) of Hudson (24, p. 43). 
In complex projective 4-space, we have 
[2 1 2]*/ G:, 
which is Mitchell’s collineation group of order 36-6!, containing 45 harmonic 
homologies whose centres are the nodes of the Burkhardt primal (2; 33; 34). 


Since the word 
(R,R»R;R4R5)® 


involves an odd number of letters, this element, which occurs in [2 1 2]? as 
the “central inversion’, does not belong to the commutator subgroup 
[2 1 2]*+; hence the central quotient group 


[2 1 2]*/ ©. ~ [2 1 2] 
is the simple group of order 25920, and [2 1 2]® is its direct product with G». 
In complex projective 5-space, we have 
[2 1 3]*/ 
(29, p. 375), which is Mitchell’s collineation group of order 18-9!, containing 


126 harmonic homologies whose centres form the Mitchell-Hamill configuration 
(23, p. 402). Its commutator subgroup is the simple group of order 9-9!: 


[2 1 3)"/€s~ HO(A, 3°) ~ PUL (Fs) 
(23, p. 451; 19, p. 310; 20, p. 48). 
8. Infinite groups. When the form 2.1 is not definite but only semi- 
definite, we have an infinite group generated by reflections 2.9 in m hyperplanes 


x, = 0 forming an (m — 1)-dimensional simplex. In fact, since det(a,) = 0, 
there exist constants z',..., 2" such that 


> 2 an =0 
> 2’ x = > 2’ x,. 


and therefore, by 2.9, 


onic 


34). 


ning 
tion 


UNITARY REFLECTION GROUPS 


i) 
ao 
~I 


Thus we may regard the group as operating in the (m — 1)-space 


> 2’x, = 1, 


in which the semidefinite form determines a unitary metric, and the R,'s 
appear as reflections in the simplex formed by the sections of the coordinate 
hyperplanes. 

Adapting 3.7, we see that [1 1 1']” is infinite when 


2 cos == + 2 cos == = 1, 
l m 
When / = 3 and m = o, this is the group 


Ri = (R2R;)* = (RsR,)* = (RiR:)*’ = E 


generated by real reflections in the sides of an ordinary equilateral triangle 
(14, p. 78). The only other two-dimensional instance is 


(11 14]*~ [1 1 19}, 


whose commutator subgroup (3, 3 | 4, 6) is already known to be infinite 
(12, p. 95; 13, p. 250). 
The group [1 1 1°]', of order 6/*, defined by 3.3 with m = 3, has two infinite 


<> 


given by the extra relations 


extensions 


Ri = (RoR;)* = (RoR2)* = (RoRiR2R;)* = E, 
Ri (R,R,)’ (R2R,)’ (R3R,)* = E, 


co 
Il 


respectively. When / = 2, they reduce to the real groups 


5 ana [5 3] 


(14, pp. 84, 85). More generally, [1 1 r']* is a subgroup of the infinite group 


[>a 


(but on p. 265 we shall see that, as a geometrical group, this is discrete only 
if / = 2, 3, 4 or 6). 
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Similarly, the finite groups 
(1 1 14, (21 14%, [11 2}*, (21 3) 
have infinite extensions 
[1 1 2*}*, (21 24%, (1 1 3)}*, (21 4)*. 


The last three were discovered by Shephard (29, pp. 382, 383, 381). But we 
must remember to change the mark inside his triangles from 4 to 3. His scheme 
does not include our 


4 
[1 1 24}4, 
whose determinant is 
2-c -l1 0) 
,|7@ 2-1 0] 700 
| pag Ore te T = 8 
0 O-!1 2 


(since cé = (c + 1) (E+ 1) = 2). 


9. Orthogonal coordinates. By working out the cofactors in the 
determinant 3.6 for [1 1 1']™, we find the adjoint form of 3.1 to be a numerical 
multiple of 


9.1 3(x1% a XeZe) + (4 —_ CE) X33 
+ (¢ + 2) (Xo¥%3 + x3%1) + (E + 2) (xs%. + x12) 
+ (2é + 1)xi%2 + (2c + 1)xo%o. 


(See Table I, on page 269, for the values of c.) 
In the case of [1 1 1]", where / = 3 and 
c = ettm 
9.1 may be expressed as 


(xy + CX2 + Xs) (41 + EF2 + Fs) + (Ex, + x2 + X3) 
+ (x1 a Xe 4 X3) 


In terms of orthogonal coordinates 
= Xi t CXet+X3, $2 = Oxi + xXe+X3, Es = Xi + X2 + Xs, 


the form is ££: + f& + &3:&s, and the reflecting planes are x, = 0 where 


(3 


an 


ve 
ne 
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9.2 (1 — c)e, = c(& — &s), (1 — chee = Es — Bs, (1 — cles = Bs — Che, 
so that the reflections themselves are: 


Ry, (é1, fo, §3) 
Re, (é1, £o, £3) 
Rs, (1, fo, §3) 


(31, pp. 276, 295). 
In the case of [1 1 1']*, we have 


c= -b-1, b = emt! 


(£1, &3, &2); 
(és, fo, &1); 
(c&2, €&1, &s) 


tis 


and the form 9.1 is 
3(x1%1 + X2%2) + (b — 1) (6 — 1)xs8; 
= (6 — 1) (xe¥3 ~ X3%,) = (b = 1) (x3¥o + X43) 
—_ (26 — 1)xi:%-e — (2b oa 1)xe# 
= (x4 — Xe) (#1 —_ £2) + (bx — Xe) (bz; —_ £2) 
+ {bx; — x2 + (b — 1)x3} {6% — %. + (6 — 1)%s}. 
In terms of orthogonal coordinates 
& = b(x, — Xe), E> = bx, — Ze, Es = bx, —Xe+ (6 — 1)xz, 
the reflecting planes are x, = 0 where 
9.3 (1 — b)x, = bf; — &, (1 —db)x2 =f: — ke, (1 — b)xs = be — Es, 
so that the reflections themselves are: 
Ri, (&1,&,&3) —> (dé, bé:, £3); 


Ro, (&1, &2,&3) —> (&2, £1, &3); 
Rs, (&1, &2,&3) —> (&1, &s, &e). 


The resemblance of 9.2 and 9.3 illustrates the fact that [1 1 1'J" and [1 1 1"]' 
are different ways of generating the same group. 


In the case of [1 1 1*]*, we have 
9.4 c = }(—1+ iv7) = 6 + B+ Bt, B =e, 
so that ¢ and é are the roots of the equation 
9.5 x+x*+2=0, 
and the form 9.1 is 


3(x1 21 +e XoXo) os 2x 3F3 — C? (xo¥s + X3%) 
—_ E?(x3Fo + X1%3) — (c — é) (X;%o —_ Xo%1) 
= 4{x1%) + (x1 + Cxe) (41 + E%2) 
+ (€x, oa C*Xe —_ 2x3) (c*Z, + &°F. - 22;)}. 


In terms of orthogonal coordinates 
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f=, & = X1+ CXo, &s = Ox, + c*x, — 2x;, 


the reflecting planes are x, = 0, where 


9.6 M1 = &, x2 = $6(—E: + £2), x3 = $(—E1 + che — &s), 


in agreement with Shephard and Todd (31, p. 295), whose a and x, are our 


—c and £,. The reflections themselves are: 
Ri, reversing the sign of &; 
Re, interchanging & and £2; 
be = $(E; + c& — &s3), 
a 4&2 § €(E, + &s), 
le: 


4(—&, + cke + &3). 
In terms of the é’s, the quartic invariant (Klein's f, mentioned near the 
end of p. 254) is 


9.7 tit & + &5 + 3c(G ES + GE + GH). 


R 


One can soon verify that this is transformed into itself not only by R, and R, 


but also by R; (cf. 22, p. 338 with b/a = 32). 
The remaining groups may be treated similarly, but the details are omitted 


because the results have all been obtained another way by Shephard (29, 
p. 373) and Todd (31, pp. 296, 298). 


10. Polytopes and honeycombs. Let O denote the origin, i.e., the point 
of intersection of the hyperplanes of the generating reflections Ri, ..., R,. 
Clearly, O is invariant, and the images of any other point are all equidistant 
from O. In particular, let P be a point on the line of intersection of n — 1 of 
the » hyperplanes, say all except the kth. Then P is invariant under the 
subgroup generated by all the R’s except R,; but R, transforms P into another 
point Q, and the whole group transforms P and Q into the vertices of a con- 
figuration called a polytope (28, p. 83) (or, when n = 2 or 3, a polygon or a 
polyhedron). Any subset of the R’s (including R,) generates a subgroup which 
transforms P into the vertices of a sub-configuration called an element of the 
polytope (e.g., a side of the polygon, or an edge or face of the polyhedron). 

Similarly, when we have an infinite discrete group in (m — 1)-space, 
generated by reflections Ri,..., R, in m hyperplanes forming a simplex, as 
in §8, the images of the kth vertex P of the simplex are said to form a honey- 
comb (29, pp. 364, 375). The kth reflection R, transforms P into another 
vertex Q of the honeycomb, and the subgroup generated by the remaining 
n — 1 R’s transforms Q into the vertex figure at P. 

When the group is symbolized by a graph, the polytope or honeycomb 
is indicated by drawing a ring round the kth node (14, p. 87; 29, p. 375), 
and an element can be found by deleting one or more nodes (along with any 
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branches that occur there). Adapting the notation of Coxeter (9, p. 331) 
and Shephard (29, p. 378), we let 
(p,q r')™, (p qr"), (pqr,')™ 


denote the polytopes (or honeycombs) indicated by ringing the tip of the 
first, second, or third tail. Of course, 


(pp, g r')™ = (q by 1’), 


and there is complete symmetry when / = 3 (in which case the / is omitted). 
The same reasoning that led to 3.9 and 4.1 shows that the two polytopes 


(ppg 1)", (bp, 9 1")! 


have the same vertices; in fact, they have the same j-dimensional elements 
for 7 = 0, 1,..., p. Sim“ arly, (1, 1 r’)* has the same vertices and edges as 


1 eee 


whose graphical symbol consists of an n-gon (m = r + 2) with one vertex 
ringed (see p. 251). 

To compute the number of vertices of (p, g r')"™ or of (p q r,')™, as in Table 
111, we divide the order of the group [p gq r‘]™ by the order of the subgroup 
that leaves one vertex invariant. This subgroup is given by reducing p to 
p — 1, orrtor — 1, respectively. To make this rule apply to (p 1, r‘)" or to 
(p q 1,')", we have to interpret the symbols 


[p Or", [p g 08}. 


We see from the graph that [p 0 r‘]"™ (including [|p 0 r]" or [|p r 0)" as the 
special case when / = 3) is the symmetric group 


represented by a simple chain of p + r — 1 unmarked branches. For instance, 
the six-dimensional polytope (2 1, 3)* has 


108 - 9! 


6! 


= 54432 


vertices. Similarly, by removing the ringed node from 


(2 1 1:*)’, 


we see that [2 1 0*]* is the extended octahedral group [3, 4], of order 48 
(14, pp. 82, 85). 
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The only remaining case of [p g 0']" with / > 3 is [1 1 0']", which is the 
dihedral group [!], of order 2/, defined by 


Ri = R} = (R,R,)' = E. 


This is the case of the polyhedron 


10.1 4 


(3 3.3.°)", 


whose faces are equilateral triangles. The initial vertex P is taken on the line 
of intersection of the first two reflecting planes (‘‘mirrors’’). The third reflec- 
tion transforms P into another vertex Q, such that PQ is an edge (symbolized 
by the ringed node alone). The first two reflections, whose product is of period 
/, transform PQ into 2/ edges PQ, PQ’, etc. (see Fig. 1). Hence this is a poly- 
hedron of type {3, 2/}: the faces are triangles such as PQQ’, and every vertex 
belongs to 2/ of them. By 6.4, the order of the group [1 1 1']" is 


g = (m, + 1) (m2. + 1) (m; + 1), 


where m,, m2, m; are computed from the roots es” of the equation 6.2 
(see Table II). Since there is one element of the group for every half-edge of 
(1 1 1,°)", this polyhedron of type {3, 2/} has $g edges, 4g triangular faces, 
and g/2/ vertices. 


P 











Q 
Fic. 1: A face PQQ’ of (1 1 1,')" 
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he When / = 2 (and therefore, by 3.3, m = 3), we have the octahedron {3, 4}. 
In other cases we may obtain a real interpretation by regarding the real and 
imaginary parts of the £’s as Cartesian coordinates in a Euclidean space of 

: twice as many dimensions. When so interpreted, (1 1 1,')" appears as a 
regular skew polyhedron (11) in Euclidean 6-space. Shephard (28) did not 

, include it in his list of regular complex polyhedra because, as we shall see in 

§ 11, the symmetry operation interchanging two adjacent faces is not, in 
general, a unitary reflection. 

When / = 3 we have (1 1 1,;)", and 9.2 shows that the first two mirrors 
meet along the line £; = £2 = £3, so we take P to be the point (1, 1, 1). Apply- 
ing the reflections, we find Q to be (c, é, 1), and Q’ (1, é, c); altogether we obtain 

ne \ the m? points 

ad (cB, ch, cs), c = etm ki + ke + ky = 0 (mod m), 

e 

od | Topologically, (1 1 1,)" is a triangulation of the torus, namely the regular 

ly- map 

ex 13, 6}mn.0 

+ = (15, p. 421), which thus has a metrical realization as a skew polyhedron in 
Euclidean 6-space. Fig. 2 shows the case m = 4, with the values of k,kok 

as marked at each vertex (so that P, Q, Q’ are 000, 130, 031). 

). 

of 

eS 

Q’ 








000 
FIG. 2: (1 1 1,)4 = {3, 6} 4.0. 


Similarly in » dimensions, reflections in the » hyperplanes 
,2n f/m) 
fo = ts, Es = be, ---, be = bry br = Che (c = em) 


generate the group 
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(l1...1), 
which transforms the point (1, 1, ..., 1) into the m"~' points 
(f*,&*,...,e%), > k, = 0 (mod m). 


These were shown by Shephard (29, p. 378) to be the vertices of the ‘fractional 
y polytope”’ 


(1, 1 r")® =i 7%, r=n-— 2. 
We now recognize the same m"~' points as the vertices of the new polytope 
ON Bry | gt 


whose (m — 1)-dimensional elements are the various truncations of the 
regular simplex a, ;, just as they are in the limiting case of the real honey- 
comb 


(1, hiawts @ Q@,—h 
(7, p. 366; 14, p. 205, footnote). 


In the case of (1 1 1,')*, 9.3 shows that the first two mirrors meet along the 
line &; = & = 0, so we take P to be (0, 0, 1) and obtain the 3/ points whose 
coordinates are the permutations of 


(b*, 0, 0), b = el! k=0,1,...,/-—1 


(29, p. 377). Accordingly, (1 1 1,')* is the “generalized octahedron” 83. 
Similarly in m dimensions, reflections in the m hyperplanes 


&: = bis, 1 = &s, f2 = Es, ..., San = bh 
generate the group 
(1 1 r*}?, r=n-—2; 
and the nl points obtained by permuting 
.44....@ (k=0,1,...,2—1) 


are the vertices of the generalized cross polytope 


_ aia 


(i iri)’ = 


(10, p. 287). When / = 2, 3, 4 or 6, this a 8, is the vertex figure 
of the honeycomb 


>— a 





al 


yure 





ee 


cate el 
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which is derived from the origin by applying the group [1 1 r‘]* along with the 
(n + 1)th reflection 


Ute + + «5: Bouts En) — (Ea, -- +» Eaats l — §,). 
The reason for restricting / to the ‘‘crystallographic’’ values 
2, 3, 4, 6 


may already be seen when m = 1 (although the graphical symbol applies only 
when nm > 3). The one-dimensional honeycomb consists of the points (£) 
whose single coordinate ¢ is derived from 0 by the two operations of multiply- 
ing by 6 and subtracting from 1. In the Argand plane, these operations appeat 
as rotation through 27// about the origin and reflection in the line x = }. 
The group so generated is not discrete when / = 5, nor when / > 7. (The 
proof, ascribed to Wigner in (14, p. 65), is really due to Barlow (3, p. 17).) 
When / = 2, the values of & are the real integers; when / = 4 they are the 
Gaussian integers x + yi; when / = 6 they are the Eisenstein integers u + vw; 
when / = 3 they are two-thirds of the Eisenstein integers, namely those 
for which « + v = 0 or 1 (mod 3) (14, p. 64). The corresponding honeycombs 
in the Argand plane are 


| oj, {4, 4}, {3, 6}, 16, 3}. 


For greater values of m, we have points (§,...,&,) where each of the m 
coordinates is restricted to the appropriate one of the four classes. The corres- 
ponding real honeycombs are the ‘‘rectangular products’’ 


j 


fo }" = bay, [4,4}" = bongs, (3, 6}", (6, 3)" 
(7, pp. 353-354; 14, pp. 123-124). 


Returning to unitary 3-space, let us investigate the polyhedron 
4 (®) 


(1 1 1,*)*, 


for which 9.6 yields the line &; = & = 0 again. To avoid fractions, we now 
take P to be (0, 0, 2), whose images are the 6 + 12 + 24 = 42 permutations 
of 


(0,0, +2), (4c, +c, 0), (41, +1, +6), 


where c is given by 9.4. Since each of these 42 vertices belongs to 8 edges, 
there are altogether 168 edges and 112 triangular faces. This polyhedron 
(1 1 1,*)* is the vertex figure of the honeycomb 
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4 @ 


(1 1 23*)4, 

whose group [1 1 2*]* is derived from [1 1 1*]* by adjoining the fourth 
reflection 

fi = &, f = £2, & = 2 — &. 
Applying this group to the origin, we obtain the points whose coordinates 
£1, £2, 3 satisfy the congruences 
10.2 f&: =f =, (modc), & + & + &: = 0 (mod é) 
in the domain of algebraic integers generated by the roots (c and é) of the 
equation 9.5. From the nature of this prescription, we see that the vertices of 
(1 1 2*)*, like those of (2 1 4,4)* (18, p. 386), form a lattice. 

The polyhedron (1; 1 1*)* (see Table III) is less interesting than (1 1 1,*)* 
because it is not regular but only “quasi-regular” (14, p. 18). In fact, the 
graph shows that each of its 336/6 = 56 vertices is surrounded by three 
triangles and three squares, arranged alternately. 


11. The symmetry group of the polyhedron (1 | |,')". The group 3.3 
has an involutory automorphism which interchanges R,; and R». Adjoining a 
new element T which transforms the group in this manner (so that T* = E, 
R, = TR.T and TR; = R;T), we obtain the larger group 


11.1 T? = R,? = R,*? = (R2R;)* = (TR)?! = (TR;)? = (TR2R;)™ = E, 


which is G*-?"-" in the notation of (12, pp. 104-105). As a transformation of 
the contravariant coordinates, leaving 3.1 invariant, we may take T to be 


2,2) — (#, #, #). 


Although this is not a unitary reflection but an “‘anti-projectivity’’, its effect 
on the polyhedron 10.1 is like that of reflecting in the common edge PQ of 
two adjacent faces. Since the fundamental region of the symmetry group of 
the polyhedron is one-sixth of a face (shaded in Fig. 1), this symmetry group 
is precisely G3": and (1 1 1,')™ is the regular skew polyhedron 


13, 2/ } 2m 


(12, p. 127) or {3, 2/|, m} (11, p. 59). In particular, we see again that the 
polyhedron 
(1 11,4)* = {3,8}s = {3,8 |, 4} 
has 42 vertices, 168 edges, and 112 triangular faces. 
Setting / = 4 and m = }p in 3.7, we deduce that the even values of n 
and p for which the group G*"* is finite are given by 


‘th 


tes 


of 


up 


the 
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11.2 cos i + cos es < 3, 


in agreement with (12, p. 120, Fig. 3), where, however, the locus drawn is 
not 


4dr 4 2 2n 3 
cos — + cos— = 4 but cos— + cos— = 3. 
n n p 


One is tempted to conjecture that the new criterion 11.2 will remain valid 
when n and p are not both even. It is satisfied by all the known finite groups 
G*"”, including 


G?9-9 ~ LF(2,19), G**!! ~ PGL(2,23), G*7-§ ~ LF(2,29). 
rhe only unknown group that should be finite, according to 11.2, is G*7*"*, 
whose subgroup (2, 3, 7; 8), of index 2, is defined by 
S? = T? = (ST)? = (S“'TST)® = E. 

Perhaps some enterprising reader will test this with an electronic computer. 

12. Barnes’s new extreme senary form. Finally, we will show how, when 
the complex 3-space is regarded as a real 6-space, the vertices of the honey 
comb (1 1 2,*)* yield the lattice representing Barnes’s new quadratic form. 


As a basis for the lattice 10.2, where c = 4(—1 + i\/7), we may use the 
six complex vectors 


t, = (1, é, 1), t. = (2, 0, 0), 
t; = (1, 1, é), t, = (0, 2, 0), 
t; = (é, 1, 1), ts = (0, 0, 2) 


(cf. 18, p. 397), which combine to yield 


(c, c,0) = —t; — ts + te, 
ic, ~€, 0) = t; —t. + t, — ts, 


and so on. Thus the general vector of the lattice is 


z=) u,t, 


= (uw; + Qu. + us + Cus, Eu, + Uy + 25 + Us, Uy + Eg + Us + 26), 
where the u’s are real integers; and its norm is 


ZZ = (uy + Quo + us + Cus) (4) + 22 + Uz + CUs 
+ (Gu, + us + 2ug + us) (CU, + Uy + 24 + Us) 
+ (uy + Eu3s + us + 2u5) (Uy + Cg + Us + 2U¢) 
= 4(u;? + uo? + us? + uy? + Us” + Ue? 
12.1 + Uys + Usk; + Ugly + Ugly + Use + Uotty 
— 2(uits + Ustls + Ugts) 
= (Qu; + us — buy + Us)? + (Qu3 + uy — $6 + U2)” 
+ (Qus + ug — duo + U4)? + F (ue? + ua? + Ue? 
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Since this definite senary form represents 4 and has determinant 7, one 
naturally looks to see whether it is equivalent to the new form 4¢¢ discovered 
by Barnes (4). To establish this equivalence, we note that Barnes represented 
his form by a real six-dimensional lattice consisting of the points in Euclidean 
7-space whose coordinates 


Vo» Vy V2, Vis Va Ys, Ye 
are integers satisfying 


6 6 
Ls=0, Liss =0 (mod 7). 


In terms of unit vectors e, along the seven orthogonal axes, he observed that 
the minimal vectors are 
e@, + Gp — @, — Cy, 
with distinct suffixes satisfying 
a+b=ec+d (mod 7) 


(5, §7), and that this set of 42 vectors >> ye, is transformed into itself by 
the permutations 


Ys Vjeu4r and Yy— Vay, 


where the suffixes are reduced modulo 7. The latter permutation (of period 
6), applied repeatedly to the vector 


ti = —@o + C1 + C5 — Ce, 
yields 

t. = —@ + e@3 + Ci — &, 

ts = —@o + C2 + C3 — Cy, 

t, = —Cy + Co + C2 — Cj, 

tt, = —Co + Cy + Ce — Cz, 

ts = —@y + Cs + Cy — Co. 


The six t’s, being independent, constitute a basis for the lattice, and enable 
us to construct the form 


(Xt) 


6 
{— > U,@o + (uy + U2 — U4)ei + (Us + Ug — U6) es 
I 
a (tu + Uz — Us)@3 + (Us + Ug — U2)e,4 


+ (ue + uy — us)@s + (uy + us — 2)@6}° 
6 2 6 
(> «,) + De (us + mses — yp)” 


6 6 3 
4D Usp + 4D Hy y41 — 2D, te Unss, 
1 1 


which is 12.1. 





eee eae 


—~ es 


uw 


cc 





——— eee me 
—_ -—- 


a 


— 
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We observe that this form (equivalent to Barnes’s has the elegant matrix 


42 0-1 0 2 
242 0-1 O 
0 2 4 2 0-1 
-l1 0 2 4 2 90 
0 -1 » 2 4 2 
2 0-1 0 2 4 


Barnes (4, p. 240) has shown that its group of automorphs is 


@, X PGL(2, 7). 


In view of §11, we now recognize this as the symmetry group G*:** of the 
complex polyhedron (1 1 1,*)*. 


TABLE I 


THE COMPUTATION OF c FOR [1 1 14J" (see 3.5) 








| 

l m 2 cos = 2 cos = 2cos— c 

l m $ 

} 

3 m 1 2 cos x/m 2 cos 2x/m eari/m 
l 3 2 cos x/l 1 | —2cos x/l —1 — @™/! 
4 4 : ae v2 | -1/v2 | (-1+iv7)/2 
4 5 Jf2 T | —7*//2 -l-f% 
5 4 T V2 | —1 Tw 
4 6 V2 V3 0 i /2 
6 4 v3 v2 | —2/¥3 —1+iv2 
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TABLE III 


Some ComMpLex PoLytTopres 














Symbol Number | Number | Symbol of Shephard (29) 

defined of of or of Coxeter (11; 12) Reference 

in §10 dimensions | vertices 
(1 1 r,")8 | n=r+2 | In | (1 1; 7,)' = 8, (29, p. 377) 
(llr)? |n=r+2 ale (1,1; 7)! = fy’, (29, p. 378) 
(1,1...) | n pt p. 251 
(11 1,*)4 3 42 | {3, 8), 4} = {3,8}. p. 265 
(1, 1 14)4 3 56 CCS p. 266 
(1 1 1,;5)* 3 216 {3, 10], 4} = {3, 10}, (11, p. 61) 
(1 1 1,*)§ 3 270 | {3, 8], 5} = {3, 8}io (11, p. 61) 
(1, 1 15)* 3 360 . 
(1, 1 1*)5 3 360 — 
(22 1 1*)8 4 80 (22 1; 1)* = (4y4,)*? (29, p. 382) 
(22 1 1) 4 80 - — 
(211,‘)? | 4 160 |} (21; 1,)* p. 261 
(21 1,)* 4 320 — - 
(2 1 22) 5 80 (2 1; 22)? = (4 y%)* (29, p. 381) 
(2 1, 2) 5 432 (2 1,1; 2) (29, p. 381) 
(2 1 3;)8 6 756 (2 1; 35)? = (4 74) (18, p. 391) 
(2. 1 3) 6 4032 (2. 1; 3)? = (4 y%)* (18, p. 391) 
(2 1, 3) 6 54432 (2 1,1; 3) 
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ABSTRACT DEFINITIONS FOR REFLECTION GROUPS 
G. C. SHEPHARD 


Introduction. This paper is an appendix to “Finite unitary reflection 
groups” by J. A. Todd and the author (3) in which the irreducible finite 
groups generated by reflections in unitary space (2, p. 364) are enumerated 
and abstract definitions are given for all those n-dimensional groups which 
may be generated by m reflections. (All real reflection groups are of this kind.) 
The purpose of this paper is to supply abstract definitions for all the remaining 
groups, namely those that require more than m reflections to generate them. 

Some of the original definitions in our paper have been elegantly modified 
by Coxeter (1). The definitions given here are constructed in such a way as to 
form natural extensions of his scheme. 


1. The groups G(m, r, 3). Let 6 be a primitive mth root of unity, r a 
divisor of m, and s = m/r. Then by permuting the rows (or columns) of the 
diagonal n X n matrices 


diag {6”’, 0”, ... , 0"}, > op; = 0 (mod r), 
in every possible way, we obtain a group of sm"~'n! matrices which is denoted 
by G(m,r,n) (3, p. 277). When r = 1 or r = m, this group is generated by n 
reflections: G(m, 1, n) is the symmetry group of the polytope 7,” (2, p. 374) 
and so, following the notation of (3), we denote it by [y,"]. Also, when > 2, 
G(m, m,n) = {1y,™] is [1 1 (2 — 2)"]* in Coxeter’s notation (1, pp. 248-249). 

Consider first the groups G(m, r, 3) with r # 1, m. The matrices 


001 0 60 100 100 
1.1 P=(|010),Q={¢'00],R=({001], T={010 
100 0 O11 010 006 


clearly generate G(m, r, 3), and if we denote by P, Q, R, T respectively the 
corresponding group elements, then the foilowing relations are satisfied: 


1.2 P? = Q? = R® = (OR)* = (RP)* = (PQ)* = E, 


§ (PTP)(RT“'R) = (RT-'R)(PTP) = (QPRP)’, 


- (TQ = QT, T(PRP) = (PRP)T, T* = E. 


We shall show that these relations form an abstract definition for the group. 
Since (PTP)* = PT*P = E, (RT-'R)* = RT-R = E, and PTP S RT“ R, 
we have 
(QPRP)" = ((QPRP)’)* = (PTP)*(RT'R)* = E, 
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that is, 
1.4 (OPRP)" = E. 


It is known that relations 1.2 and 1.4 constitute an abstract definition for the 
group G(m,m,3) = [1 11]" of order 6m? (1, p. 248). Denote an arbitrary 
element of this group by G; each occurrence of G in an expression stands for a 
(possibly) different element of the group. 

By 1.3, T, and therefore 7“, commutes with PRP and so PT*PR = RPT“P. 
This implies 

(QPRP)™ = (PT“P)(RT-“R) = RPT*PT—R, 

from which 7*P = GT™. Similarly making use of the fact that T commutes 
with RPR = PRP, we get T“R = GT". 

Let W be any word, i.e. finite product of the elements P, Q, R, T. Then W 
must be of the form 

GT“ GT"G ...GT’’G. 

Let w(W) be the number of factors P, Q, R occurring in all the elements G 
of W except the first. Since 


TP = GT™, TQ = QT, TR = GT", 
whatever letter P, Q, or R follows 7“; in W it is possible to substitute some 
product of group elements so as to form an equivalent word W’ with 
w(W) — w(W’) > 1. Repeating this process and amalgamating powers of T 
when they become adjacent, W may be reduced, by a finite number of sub- 
stitutions, to the standard form 
W* = GT", O<cucs-—l, 


with w(W*) = 0. Hence the given relations define a group of order (6m?)s 
which is G(m, r, 3). 

For G(4, 2, 3) a simpler definition than that obtained by substituting m = 4 
and r = 2 in 1.3 is given by 1.2 together with the relations 


1.5 (QPRP)* = E, 
1.6 T? = (TQ)? = (TPRP)? = (TR)* = P(TROR)P(TROQR)“ = E. 
This can be checked by the Todd-Coxeter method (la, Chap. 2). 


2. The other groups. Now consider G(m, r, n) (rim; r ~ 1, m; n > 3) 


of order rm™—'n!. If we introduce m — 3 further generators S;, S.,...,: Bit 
which satisfy the relations 
S? = (PS,)? = (RORS,)? = (S,S;)? = (RS))* = E 
2.1 (1,7 = 1,2,...," — 3; |i — j| > 2), 
(S,S,1)* = (RS,)? = E (¢ = 2,3,..., n — 3), 


then these, together with 1.2, 1.4, form a definition for G(m, m, n) of order 
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m"~'n!. This is the same definition as that of (2, pp. 374-5) with the notation 
changed thus: 


P = P,, Q = R,Q,R,, R= R,, S, = Rus (¢ = 1,2,...,”2 — 8). 


The use of these new generators is due to Coxeter (1, p. 248) who has shown 
that they simplify the definitions of some of the reflection groups discussed 
in (3). 

Adding now the generator 7 satisfying 13 and 7S, = S,T (i = 1, 
2,...,m — 3) we get an abstract definition for G(m,r,n). The proof is very 
similar to that of §1, remarking that every word can be reduced to the stand- 
ard form G’T“, where G’ is an arbitrary element of G(m, m, n). 

This completes the discussion of G(m,r,m) except for the groups where 
n = 2. Here we take as generators P, Q and T corresponding to the matrices 


r=(72), o-(%.%. 1-(9) 


and obtain the relations 
P=Q=T*=E£E, T(PQ) = (PQ)T, PT“PT = TPT 'P = (PQ)’, 


which can be shown to be an abstract definition in the same way as for the 
the above groups. 

The only remaining groups generated by more than m reflections are seven 
two-dimensional groups for which abstract definitions have already been 
given (3, pp. 281-282; 1, Table XIII) and the group [(4y;‘)*"] (no. 31 in the 
table of (3, p. 301)). 

This group is generated by the matrices (cf. 1.1): 


1000 i 6. @.-¢ 1000 
0001 a. 8 ¢ @ 0100 
aii oo10} 2=lo -i 0 O R= 0001 
0100 eo @ 6 1 0010 
1-1 —1 —!1 1 0 0 0 
—1 1-1 —1 0 1 0 O 
— 2 = 
ie toe 1 —1 T Se 2. © 
—-1-1 -1 1 0 0 0-1 


The corresponding group elements satisfy the relations 1.2, 1.5, 1.6 and 
2.2 S? = (SP)? = (SR)? = (SQ)* = E, 
2.3 (ST)? = E. . 


By §1, the relations 1.2, 1.5, 1.6 form an abstract definition for G(4, ?, 3), 
while 1.2, 1.5, 2.2 form an abstract definition for [1 1 2]* = [(4ys*)*'] (2, 
p. 374). We assert that 1.2, 1.5, 1.6, 2.2, 2.3 together form an abstract de- 
finition for the group [(}y73*)*']. This can readily be verified by the Todd- 
Coxeter method by enumerating the six cosets of the subgroup [1 1 2]*. 
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DIOPHANTINE APPROXIMATION AND 
HOROCYCLIC GROUPS 


R. A. RANKIN 


1. Introduction. Let w be an irrational number. It is well known that 
there exists a positive real number / such that the inequality 


a 1 
(1) a <i 


has infinitely many solutions in coprime integers a and c. A theorem of Hurwitz 
asserts that the set of all such numbers h is a closed set with supremum 5. 
Various proofs of these results are known, among them one by Ford (1), in 
which he makes use of properties of the modular group. This approach suggests 
the following generalization. 

Let I be a (real) zonal horocyclic group. That is, [ is a Fuchsian group of 
bilinear transformations of the first kind (4) having the real axis as principal 
circle and © as a parabolic fixed point. A bilinear transformation T is defined 
by 


where a, b, c and d are real numbers such that ad — bc = 1, and w and z 
are complex numbers. It is convenient to use T to denote the matrix 


c ad 
as well as for the transformation, and to assume that [I is such that if 7 
belongs to I, then so does 


Since — 7 gives rise to the same value of w as T, we do not distinguish between 
T and —T. 

Since I has a parabolic fixed point at infinity and is discrete, there exists a 
least positive \ such that U*® € IT, where 


_ (1 1 pa (? ) 
v-(5 ): 7 


The number A has been called by Petersson the width of the cusp at ». The 
transformation U* generates a cyclic subgroup of [ which we call Ty. In 
particular, if T € [then TU™ € T for any integer m, and TU™ has the same 
Received August 15, 1956. 
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first and third entries, namely a and c, as 7. Thus a and ¢ are functions of, 
and are determined by, the left cosets of [', in I. 

In terms of a zonal horocyclic group I we can generalize Diophantine 
approximation to irrational numbers by asking if there exists a positive 
h = h(T) such that (1) is soluble for transformations T in infinitely many 
left cosets of Ty in I’, where w is any real number not belonging to a certain 
countable set, to be specified. We can, for this purpose, rewrite (1) as 


(2) lw — To| < Se, 

he 
and it is possible to express the right-hand side in terms of T also, although 
it is not particularly advantageous to do this. The exceptional countable set 
will always include the set of all points congruent to © with respect to I. 

If such a number h exists, we can then ask what is the supremum of all such 
h, and whether this supremum is attained. If (2) holds for infinitely many 
left cosets of Ty in I, the numbers JT satisfying (2) will approximate to w 
as closely as we please since it is known that |c| is less than any given positive 
number for only a finite number of cosets of I'y (3, Satz 2). 

In the particular case where [ is the modular group (1), this reduces to 
the approximation by rationals to an arbitrary irrational number w, as de- 
scribed at the beginning of the paper. Ford’s demonstration of Hurwitz’s 
theorem does not, however, generalize immediately to an arbitrary zonal 
horocyclic group, since the possible existence of parabolic cycles not congruent 
to ~ causes difficulties. Also, as might be expected, horocyclic groups of the 
second kind, which have an infinity of generators, are more difficult to treat 
since the set of points congruent to © covers the real axis more thinly. In 
this paper we consider horocyclic groups of the first kind only (see §2 for 
definition). 

Applications of the results obtained to the modular group and some of its 
subgroups are given in §7 and compared with results obtained by others. 
Considering the generality of the methods used in §§2-6, it is somewhat 
surprising that best possible results are nevertheless obtained in some cases. 
The horocyclic group approach also sheds some light on the connexion 
between irrationals which cannot be closely approximated to and hyperbolic 
transformations. 


2. The existence of numbers 4. We write z = x + iy, where x and y 
are real and use § to denote the finite upper half-plane y > 0, and & for the 
real axis y = 0, excluding the point at infinity. We suppose that I is a zonal 
horocyclic group and that D is a fundamental region for T constructed by 
Ford’s isometric circle method (2, §35) and occupying the strip — < x < — + A, 
where X is the width of the cusp at infinity and € is chosen suitably later. 

We suppose that IT is a horocyclic group of the first kind, i.e. that D has 
finite hyperbolic area. This is the case if and only if D has a finite number 
of sides. 


anc 





P 
‘ 
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Adjacent sides of D meet at vertices which are of two kinds. The point at 
infinity and vertices lying on & are called cusps. They can be divided into 
groups of congruent cusps with respect to I’, called parabolic cycles. One such 
cycle, which may be the only one, consists of the point at infinity alone. 

The remaining vertices all lie in §. They include points at which different 
isometric circles meet, and possibly elliptic fixed points of period 2 belonging 
to T which lie at a midpoint of a single bounding arc. Such vertices we call 
ordinary vertices of the first kind. 

It may happen that some sides of D consist of complete semicircles: if this 
happens it is convenient to regard such a semicircle as two equal sides meeting 
at a vertex which is the point farthest from %; such vertices we call ordinary 
vertices of the second kind. 

We now choose £ so that the line x = & passes through an ordinary vertex 
of the first or second kind. Then every vertex of D in © is of the first or second 
kind, and D is bounded by a finite number of sides which can be grouped in 
pairs of equal arcs transformable into each other by transformations of I. 
One side of such a pair (if not part of a straight line) is an arc of the isometric 
circle §(7) of a transformation 7, and the other is an equal arc of $(7~"'), 
where T € I. 

Define hy to be twice the minimum distance of the vertices of D in § from 
%. We are now in a position to state 


THEOREM 1. Let I be a zonal horocyclic group of the first kind, and let w 
be any real number which is not a parabolic fixed point for T. Then there are 
infinitely many left cosets of Ty in T whose members T satisfy 


l 
— 0). 
< hee (c # 0) 


(3) o—*] = lw — Teo! 
c | 


3. Proof of Theorem 1. Let w be any real number which is not a parabolic 
fixed point for T and let = &(w) be the straight line through w which is 
perpendicular to Wf. 

For any T € Tf with T ¢ Ty and for any h > 0, let S(T, h) be the circle 

| 4% 
(4) los —-a—-~| =-, 
he he 
which is of radius 1/(he*) and touches & at a/c, but otherwise lies in §. 
If 2 cuts S(T, h), we have 


a} ] 
lw—--| <=, 
c he 


and conversely; i.e. (2) holds. 
Let 2’(h) be the line y = 4h; then 


T @(h) = S(T, h) 
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if 7 €¢T, 7 ¢Ty. For T € Ty we define S(T, h) to be & (hk) and it then 
follows that 


SS(T,h) = S(ST,h) = ST @(h) = ST S(E, h) 


for all S, T € TI, where E is the identical transformation. 

Accordingly % cuts S(T, h) (T ¢ Ty), and so (2) holds, if and only if T—' & 
cuts %’(h). But 7-' & is a circle centred at a point of & and so cuts &’ (A) 
if and only if its radius exceeds $h. Also, if T—' 2 cuts &’ (hk) so does (T’)—' & 
if T’ and T are in the same left coset of Ty in T; for T-' 2 and (T’)~ & are 
equal circles r\ apart where r is an integer. Thus the theorem will follow if 
we show that &’ (/,) is cut by infinitely many circles T7-' 2 (T € T) at points 
lying in the strip § <x < — +. 

Since w is not a parabolic fixed point, the line & passes through the interiors 
of infinitely many fundamental regions TD (T € 1). Let the fundamental 
regions through whose closures 2 passes be 7,D (n = 1,2,3,...) taken in 
order as a point z moves along 2 from @ through § to Y; if 2 passes through 
any points congruent to vertices of D round which more than two fundamental 
regions cluster, then some definite ordering of the associated transformations 
T, must be determined. 

We suppose that Theorem 1 is false. Then there exists an integer N such that 
, = T,—' 2 does not cut & (hy) for n > N. Let k be the maximum number of 
images of D which are grouped round an ordinary vertex of D. We show first 
that %, does not pass through an ordinary vertex of D ifm > p = N+ k. 

For suppose that 2%, passes through the ordinary vertex A of D for some 
n > p. Then A is at a distance $hy from YW and is that point in § of &, which is 
farthest from &, i.e. the summit of &,. It follows that A is not an ordinary 
vertex of the second kind since, if it were, then 2, would cut & in two parabolic 
fixed points which is impossible since w is not a parabolic fixed point. 

Thus A must be an ordinary vertex of the first kind, and so lies on an 
isometric circle $(7T) of a transformation T of I, of which a finite arc y 
forms one of the sides of D. If T is chosen in its right coset of 'y in T so that 
TA is also a vertex of D, then T¥, = TT,~' 2 meets the closure of D in TA, 
and so TT,-' = T,,~! for some m # n. Moreover m > n — k > N. Hence 
g,. = T %, does not cut &’ (Ay) but meets the closure of D in TA which must 
therefore be the summit of &,,. We thus have two equal circles %,, and T &, 
with summits A and TA. Since the transformation T is equivalent to an 
inversion in $(7) followed by a reflexion in the radical axis of $(T) and 
3(T-), this is possible only if A lies on $(T) and if 2%, = 4(T). Hence y 
is an arc of &, with one endpoint at its summit A. The other endpoint is either 
an ordinary vertex of the first kind which is nearer to & than A, or is a parabolic 
fixed point which is congruent to w since it is not congruent to ©. In either 
case we have a contradiction. 

Hence %, does not pass through an ordinary vertex of D if m > p. Thus, 
for each n > p, &, passes through the interior of D cutting its sides at interior 


1en 


10r 


oe 


HOROCYCLIC GROUPS 281 


points. This is a contradiction if there are no parabolic cycles other than that 
containing the point at infinity, and so the theorem is true in this case. 
Suppose therefore that other parabolic cycles exist. Then, for each m > p, 
¥,, contains in its interior a cusp A, of D, and consists in part of an arc lying 
in the interior of D whose endpoints B,, B,’ lie on sides /,, /,’ of D which 
meet at A,. Here 


qT, B, = Ta+1 Basi, 
and the points B,, B,’ are interior points of the sides /,, /,’, respectively. It 
follows that 
Tutt Tn 
transforms /,’ into /,,, and therefore maps A, into A,,;. Hence all the points 


A, for n > p are congruent and so belong to the same parabolic cycle. Accord- 
ingly there exists a positive integer g such that 


Apirg = Ay (y = 0, 1,2,...) 
and 

ep EF (r = 0,1,2,...) 
where 


P = T;, 71>. 
Here P is a parabolic transformation having A, as fixed point. 
Choose a bilinear transformation S,; having W& as fixed circle and such that 
S,A, = ©. Then P = S,' U* S, for some up # 0, and so 


(5) Sik. = SyP’ T>*2 = U" ST,’ 2 = U" SX. 


Since ¥,4,, cuts /, and /,’, S:2,4,, cuts S,/, and S,/,’; these are lines perpendicular 
to & since they pass through S,A, = ©, and this holds for r = 0,1,2,.... 
But, by (5), the set of circles S,2,,,, consists of the circle S,%, and its translates 
through multiples of » parallel to A, and so S,%,4,, cannot cut S,/, and S;,/,’ 
for every r > 0. 

This contradiction shows that no integer N exists, and this completes the 
proof of Theorem 1. 


4. Upper and lower bounds for sup h. Let €(£, [) denote the set ol 
all positive numbers h for which (2) holds for transformations T belonging 
to infinitely many left cosets of Ty in I’, where I is, as previously, a zonal 
horocyclic group of the first kind. The reason for displaying E will appear 
later. By Theorem 1, hy € G(E,T) and so h € G(E,T) for all positive 
h < hy. In this section‘we obtain an upper bound for €(E£, T). We prove 
first 


THEOREM 2. The set €(E, T) is bounded above. 


Proof. The group [ contains hyperbolic transformations since if 7 © [ 











282 R. A. RANKIN 


and c # 0, then U"T € [ for all integers r and is hyperbolic if r is sufficiently 
large. Let 


AB 
(6) as - ®) 


be a hyperbolic transformation belonging to [ with C > 0 and A + D > 2: 


this latter condition can always be satisfied by taking H—' if it is not true for H. 
Write 


A+ D = 2secé@ (0 <0 < $n). 
Then the fixed points of H are 
z, = {tan(4" — 30) — D}/C, 22 = {tan(4e + 40) — D}/C. 
They are the points F), F: of Fig. 1. 




















4 
| 
A fundamental region Dy in © for the subgroup ['y of I generated by H 
is the region exterior to the circles 
$1 = (A): |Cze+ D| = 1, 
and 
$2 = S$(A-'): |Cz —A| = 1. 
The circles H"3, (nm = 0, +1, +2,...) bound the images of Dy by Try, 4 


and consist of a set of non-intersecting coaxial circles with limit points F; 
and F;. For n > 1, these circles are interior to H3, = 2, and all contain F;; 
for n < 0, these circles are interior to %, and all contain F;. 


Vv 
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Let € be the semicircle in § on FF: as diameter. € has radius (tan @)/C 
and cuts $3; and 9:2 orthogonally at the points P; and P2, say; @ is the angle 
P, O; F;, where O, is the centre of &;. 

Suppose that m is the least positive value of |c| for all transformations 
T € T; by considering T7-' U* T we see that m > 1/\. Take any number h/ 


satisfying 
2tané C 
h> max(2t2n 6 oe cot ys) 


C 
(7) 422 -¢_, ....£ 
' iC] ae m(|A + DI — 2)8 
= PH, 


say. Also take w = 2;. We show that \(w) cuts only a finite number of the 
circles or straight lines S(T, Ah) for T € I. 
If £ cuts S(T, h) at a point of § lying in H—-" Dy, then H" ¥ will cut 
H" S(T, h) = S(H" T, h) 


in a point of § lying in Dy. But for n < 0, H" & lies entirely within 9), and 
so does not meet Dy; thus we need only consider n > 0. Let ©, be the closure 
of the part of H" 2 which lies in Dy. Then it suffices to show that the arcs 
C, (n = 0,1,2,...) have only a finite number of intersections with the 
curves S (T,h) for T ¢ 

For » > 1, H" % is a circle orthogonal to Y% passing through the fixed point 
w and through ¢, = H" o. The point ¢, lies within H” &,, and so within Qo, 
and 22 < ¢, < ¢; = A/C. Hence the arcs ©, for n > 0 all lie in the closed 
region j§ of § lying between the abscissae x = — D/C, x = A/C and outside 
31, J: and €. Let Qi, Q2 be the summits of 3, and &, respectively so that the 
boundary of § consists of five sides (taken closed) joining the vertices ©, 
Qi, Pi, Po, Q2 and ~. We write S; and GS» for the sides joining © to Q; and 
Q» respectively. 

We divide the proof into five cases. 

(i) The line &’(h) = S(E,h) cuts only a finite number of the arcs &,, 
since ©, tends to € as nm — © and $h exceeds (tan @)/C which is the radius 
of &. 

We now show that no circle S(7,h), for T r, 7 ¢ Ty, has any points 
in common with §. 

Suppose (ii) that S (7, h) meets S, but contains no point of the arc Q\?P; 
Then its radius exceeds 1/C, so that 


C > hc? > hm? > C cot 48, 


which is a contradiction since 6 < 4x. Similarly S (T,h) cannot meet S» 
and contain no point of Q2P». 


Write 7, = HT, where 


(8) 7, = («: os) , 
Ci d, 
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and suppose next (iii) that c, = Ca + Dc = 0, so that S (7,h) touches & 
at O,. Then 7; € I'y, so that we may take c = C,a = — D. Then if S (T, h) 
meets jf}, 2/(hc?) > 1/C so that, by (7), 


tan@<1, C* < 2m’ tan 36. 
This gives a contradiction since 2 tan 46 < 2(./2 — 1) < 1 and C? > m’. 
In the same way we can show that & (7, #) cannot touch & at A/C. 
We now suppose (iv) that c, ~ 0 and that © (7, hk) contains in its interior 
or on its perimeter a point 
zs = (e* — D)/C 
of Q,P:, where 6 < @ < 4x. Then, by (4), 


joa — Z| = 1 | 00s ¢ - «1) +i(csin ¢ - £)| <P. 


Cc he 


Hence 
26 sin @ > (ccos ¢ — 4)’ + c’sin’¢ 
= (¢c + c)’sin’}¢ + (c — c)*cos }$¢. 


Since 7, € [ and c, #0, either |c + c¢,;| > 2m or |c —c;| > 2m. Also 


sin?*4¢ < cos*4¢, so that 


26 sin @ > 4m’sin*}¢. 
Thus 
h< oc cot 4¢< & cot 46, 
m m 


which contradicts (7). We can also show that S (7, 4) cannot contain a point 
of Q2P2; for, if it did, S (H-' T, h) would contain a point of Q,P:. 

Finally (v) suppose that S (7, 4) contains a point of P,P, but not any point 
of Q,P; or Q2P2. Then the radius of S (7, 4) is greater than that of the circle 
which touches & and touches € internally at P;. This circle has radius 
(tan 40)/C, so that 

1/ (hc?) > tan $0/C. 
Thus 
h < (C/c*) cot 48 < (C/m?*) cot 46, 


which contradicts (7). 

We have therefore shown that no circle S (7,h) with T € T, T ¢Tvp, 
has any point in common with §, so that &% cuts at most a finite number of the 
circles S (T, h) and therefore h ¢ €(E£, [T). This proves Theorem 2. 

If we define 

h(E, T) = suph (h € &(E,1T)), 


ior 


lso 


int 


int 
cle 
jus 
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and write 
(9) hy’ = inf py (H hyperbolic in TL), 


we have proved 


THEOREM 3. fy < A(E,T) < hy’. 


5. Approximation to w by ST for fixed S. Let S€ Qg, where Qe 
is the continuous group of all bilinear transformations with & as principal 
circle, and 


0) =i 8) 
(10 S t 3)" 


We consider whether it is possible to approximate to a real number w by 
ST @ where T belongs to a zonal horocyclic group I of the first kind. This is 
not quite the same as approximating to S~'w by JT, since instead of using 
1/c? as a measure of approximation we shall use 1/c,? where 7, = ST and is 
given by (8). 

Let A(Ty, T) be a set of representatives of the left cosets of Ty in I 
arranged in such an order that |c| ~ » as T runs through #(Ty, 1). We 
prove the 


Lemma. Let &’ be any subset of A(T y, T) consisting of transformations T 
¥y £9 J 


for which T,e = ST @ is bounded. Then \c,| < M for only a finite number of 


T € #' where M is any given positive number. 
Proof. Suppose that |T,;o| < K for all T € 4’. Then 


lex] = |va + be] = |c||yT + 3| = |e|/|yTi” — a 
> Iel/ {IK + lal}. 


Since |c| < M{|y|K + Jal} for only a finite number of T ¢ A(Ty, LT), the 
result follows. 

Now suppose that w is any finite real number such that S~'w is not a parabolic 
point of fT. Then, for any h € €(E£, 1), 


(11) lsty — To | <Q (c ¥ 0), 
for infinitely many T € #A(Ty, 1). Let # be an infinite subset of (Ty, T) 
for which (11) holds. Since |c| ~ © as T runs through #(Ty, T), we may 
suppose that #’ is taken so that the points T, for T € F’, lie in as small an 
interval as we please round S~'w, and hence that the points ST @ lie in a 
bounded region containing w. It follows from the lemma that |c,;| < M for a 
finite number of T € #’ only, and we may therefore arrange the transforma- 
tions T in #’ in such an order that both |c| and |c;| tend to infinity as 7 runs 


through #’. 
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For any T € #&’ we have, from (11), 
lw — STo| = |(Sw — To){(yS"' w + 8)(yTo + 8)}"| 
1| yTo +6 


<p lyst +8) OTe +H 
we nie | PEO HS 
= hey ly Sw + 6 
As T runs through 7’, 
ee Ss) 
lyS ty + 6 ' 


and so, if h’ is any positive number less than h, we have 
] 
h'cy" 


for infinitely many T € A(Ty, T). We have therefore proved 


lw — STo| < 


THEOREM 4. [fh < h(E, 1), S € Qe, and w is any finite real number such 
that S~'w is not a parabolic fixed point of T, then 


= - l 
lw — ST] = lw — T,0| < 3 
he, 
for all T belonging to an infinite subset 2’ of A(T y, T) which may be arranged 


so that \c;| + © as T runs through 2’. 


Let © (S, IT’) be the set of all positive # for which the conclusions of Theorem 
4 hold with the conditions there stated on S, w and #’, and let h(S, [) be its 
supremum. Then, by Theorem 4, 


h(S, T) > h(£, YT). 
But we have, conversely, 
1 |yS"w+é 
he’ | yTo +6 
for every h in & (S, T). Since ST # — w as T runs through #4’, To + Sw 
as T runs through # and so 


|\S*w— To 


h(E, T) > h(S, T). 
We have therefore proved 
THEOREM 5. Jf S € Qe, then h(E, T) = h(S, YT). 


Hence all the sets € (S, [) are identical, with the possible exception of 
one point. 
We can also prove 


THEOREM 6. hy € € (S, T) if S € Qe. 


uch 
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To prove this we carry through the argument of §3 with certain modifica- 
tions. We wish to show that S-' % cuts S(7, h) for T in infinitely many left 
cosets of ['y in [, when S~'w is not a parabolic fixed point. The demonstration 
proceeds as in §4 with & replaced by S~' & wherever it occurs after (4). Where 
before we used the fact that w was not a parabolic fixed point we now require 
that S~'w is not a parabolic fixed point. As before, S~' 2 will pass through the 
closures of infinitely many fundamental regions 7, D (m = 1, 2,3,...). 
We now choose 7,(D) to be the fundamental region containing the summit of 
S-' ¥ and proceed along S~' % towards S~'w. The case where the other end- 
point of y lies on & can now be excluded since, because of this choice this 
endpoint would have to be S~'w which is impossible as S~'w is not a cusp. 


6. Another upper bound for 4. Suppose that I* is a zonal horocyclic 
group of the first kind and that IT is a subgroup of finite index in I'*, and so 
also a zonal horocyclic group of the first kind. Let « be the width of the cusp at 
infinity for ['*, so that A = ky, where k is a positive integer, and let T'y* be 
the subgroup of I'* generated by LU. 

The parabolic fixed points of ['* are parabolic fixed points of I, and 
conversely. We suppose that w is not a parabolic fixed point for [. Then, by 
Theorem 5, 


(12) h(E, T) = k(S, T) for all S € I*. 

Take a fixed S € I™* and a fixed hyperbolic H in [* with entries given by 
(6), such that 
(13) S¢qreror, 


where [',* is the subgroup of I'* generated by H. Further, let mys be the 
least value of |c| for all T € T'y* ST, so that mys > 0, by (13). Also take any 
h with 

9}! 
(14) h> 5 cot 46 = £|\4 +14 2] = pys; 
where the notation is that used in $4. 

Take w = 2;. We apply the method of §4 to show that ¥ does not intersect 
any S (ST’, hk) for T’ € I. This is the same as showing that no circle' S (T, h) 
intersects any arc ©, (m > 0) where T = H"ST’. 

Case (i) of §4 does not arise, and the proof of case (ii) is similar, with m 
replaced by mys. Case (iii) also does not arise, since if c,; = 0, where 7, = HT, 
we should have 7, = H"*'ST’ € [y*, which contradicts (13). Similarly if 
Ca — Ac = 0. The proof of cases (iv) and (v) is similar with m replaced by 
mys. Thus we have shown that if h satisfies (14) and S satisfies (13) then 
h¢€(S, PT). 

Define 


hy = inf pis 


| S(T, h) cannot be a straight line by (13). 











288 R. A. RANKIN 


for all hyperbolic transformations H € I* and S € I™* for which (13) holds; 
if no such H and S exist define hy’"’ = @. It follows from (12) and what we 
have just proved that the following result holds. 


THEOREM 7. h(S*, T) < hy” for all S* € I*. 


7. Applications. 


7.1. If we take I to be the full modular group ['(1) and w irrational, we 
obtain hy = +/3. By taking 
C? 
H = 11 


in §4 we see that if h > +/5, then h ¢ € (Z£, ['(1)). Thus 
V/3 < A(E, T(1)) < V5. 


In fact h(E, T) = V5 as can be obtained by a more detailed study of the 
modular configuration, as shown by Ford (1). 


7.2. If we take T = ['(2), the principal congruence group of level 2 con- 
sisting of matrices JT = E (mod 2), we obtain by Ford’s method the funda- 
mental region D consisting of points in the strip —} <x < 3/2, y>0 
which lie outside the circles |2z2 + 1| = 1, |22 — 3| = 1. Thus Ap = 1. 


Write 


: __fo -1 
(15) v=(‘ 5. 


and take [* = (1) in §6. Let 
Aaa © :. 2 -( at 
s-uvu=(! 1 , H=H,= 1 on , 


for any positive integer m. Then (13) states that S # E, V, U, VU (mod 2) 
which is correct, and mys = 1. Hence 


as n— ~. Thus hy” < 1, and so, by Theorems 3, 5, 6 and 7, 

h(S*, T(2)) = 1, 1 € €(S*, r(2)), 
for all S* € T (1). By taking (i) S* = E, (ii) S* = UVU, (iii) S* = V, we 
deduce that (1) is soluble for every irrational w for an infinity of coprime 
integers a, c in each of the three cases (i) a odd, c even, (ii) a, c both odd, 


(iii) a even, c odd, if and only if h < 1. These results are due to Scott (5), 
who also obtained them by considering I(2). 


7.3. Take T to be [y(2), the group of matrices T congruent to E or V 
modulo 2. A fundamental region for ['y(2) is given by 





| 
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0<x<2, y>9, [ei >1, |s —2| > 1, 
so that Ap = 2. 
As before, take [* = I'(1) and 


_ _, _ (n+1 meet 
S = UVU, H =H; = ( : 4 


for any positive integer m. Then (13) states that S # E, V, U, UV (mod 2), 
which is correct, and mgs = 1. Thus pgs’ ~ 2 as n — @, so that hy” < 2. 
Hence, by Theorems 3, 5, 6 and 7, 


h(S*, Ty(2)) = 2, 2 € € (S*, Ty(2)) 


for all S* € (1). 

By taking S* to be (a) U, (b) E, (c) UVU, we deduce that (1) is soluble for 
irrational w for an infinity of coprime integers a, c if and only if h < 2, in 
each of the following three cases (a) (i) or (ii), (b) (i) or (iii), (c) (ii) or (iii) 
(see §7.2). These results are also due to Scott (5). 

7.4. Let T be the principal (inhomogeneous) congruence group [(N) of 
level N > 2. This is a self-conjugate subgroup of [(1) of index yu(N), say. 
From a set of u(NV) representatives of the cosets of [(NV) in [f'(1) we can choose 
a set Gy of u(N)/N matrices S* with different a, y modulo N and (a, y, N) = 1. 
Let S be such a matrix with 


a=0O(modN), vy = [$(N — 1] 


(integral part) so that (y, V) = 1. Also take * = [(1) and 


_ an _ (Na +1 mens), (2 °) 
n= Hy = ( 1 Nn+1/°\1 1 (mod W). 
Then 
_(1 0 a +) 
w=(} ). H’ U 3, (mod NV) 


and S # H’ U* (mod N), showing that (13) is satisfied. Also, if T € 
ry*ST, c =v (mod N), so that mgs = y. We have, by (14), pgs’ — 1/7? 
as n — ©, and so, by Theorem 7, 


h(S*, T(N)) < {[3(N -— 1))}?* ~ 4" 


for large N and S* € Gy. 

Lower bounds for h(S*, f(N)) may be obtained from Theorem 3, by esti- 
mating hy, for example by (4, Theorem 10), but this does not yield good 
results, at any rate for large N. 

Each S* € Gy yields a result on the approximation to irrational w by 
fractions a/c with a = a*, c = y* (mod N), where a and c need not be positive 
integers. A considerable amount of work has been done by Descombes and 
Poitou, Hartman, Koksma and Tornheim (6) on this and related problems 
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using different methods such as continued fractions. See (6) for references. 
In particular, it is known that h(S*,f(N)) > 4/N® and that h(S*, P(N))~4/N? 
for large N. For N = 3, 4, h(S*, P(N)) = (5/3) and 3, respectively. 
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EQUIVALENT ABELIAN GROUPS 
J. DE GROOT 


1. Introduction. Throughout this note all groups are abelian, written 
additively. We refer to Kurosh (8; 9) for notation, terminology and theorems 
used without reference. We recall the notion of a serving subgroup (or pure 
subgroup) © of a group GW. This is a subgroup © in which for every natural 
number m every equation mx = s, s € S can be solved provided that it can 
be solved in G. If G is torsion-free, “linearly closed’’ subgroups coincide with 
serving subgroups and © is a serving subgroup if and only if G/G@ is torsion- 
free. Direct summands are serving subgroups but, in general, the converse is 
untrue (cf. 4). 

We call the groups @ and § equivalent (or equivalent by subgroups) if each 
is isomorphic to a subgroup of the other, i.e. 

G — 6G @’ C §, 

§$-vH = 9' CG, 

where @ and y are isomorphic maps. If G’ and §’ are serving subgroups or 
direct summands of § and © respectively, we call G and § equivalent by 
serving subgroups or equivalent by direct summands. 

We give a short survey of the main results obtained so far (a ‘‘positive 
answer” means that equivalence implies isomorphism of the groups under 
consideration, otherwise we speak of a ‘‘negative answer’’). 

Equivalence by subgroups. In the main the problem is solved. In general, 
the answer is negative. De Bruijn (orally) and Kaplansky (6) gave a counter- 
example for periodic groups. Here is one for the case of torsion-free groups 
(by a slight alteration the groups can be made countable). 

Example (i). @ is the additive group of real numbers, = @ + €, where 
€ is an infinite cyclic group. § can be embedded isomorphically in G, just as 
any torsion-free group with at most continuously many elements. So @ and § 
are equivalent but clearly not isomorphic. 

Nevertheless, in each of the following cases, the answer is positive: the class 
of complete groups,' the class of groups with a finite number of generators 
(cf. 1), the class of subgroups of the additive group of rationals. 

Equivalence by serving subgroups. In the main the problem is solved. 
In general, the answer is negative. Indeed, in §4, we shall construct a counter- 
example. @ and § are both countable, torsion-free groups, decomposable into 
groups of rank <2. 


(1) 


Received April 24, 1956; in revised form October 24, 1956. 
1A group @ is complete (or divisible) if each equation mx = g, g € G for every natural 
number » has at least one solution in G, that is if n»G =@. 
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However, it is proved in (5), using the results of §2, that for the class of 
completely decomposable groups* the answer is positive. 

Equivalence by direct summands. In the main the problem is unsolved. 
The answer is positive for the class of countable periodic groups (Kaplansky 
(6)) and the class of completely decomposable groups (5). 

In general, the author expects a negative answer in the case of torsion-free 
groups. However, the structure of the torsion-free groups seems not yet 
sufficiently cleared to overcome the difficulties involved. The problem is 
difficult (Massey (11)); for its topological consequences and some other 
positive results, see Yang (13). 

In §3 we construct a simple example of an indecomposable abelian group of 
arbitrary finite rank. 


2. Positive results. 


THEOREM I.* Jf @ and § are equivalent by direct summands (¢@ and yH 
in (1) being direct summands) 1nd if in one of the groups the sum of an ascending 
sequence of direct summands is again a (proper or improper) direct summand, 
then © and § are isomorphic. 


Moreover, it is possible to determine direct decompositions 


(2) G = G, + G, 
(3) 9) _ 9, + H- 
such that the relations 

(4) oG, = 9, 
(5) v'G@. = G2 


hold true and define an isomorphic map of @ onto §. 


Proof. Consider a direct decomposition 


(6) 0) = RK + V9, 
and define @, by 
(7) G, = R+ VoR + YovoRK +.... 


This makes sense, since the property of being a direct summand is transitive. 
VOR is a direct summand of ¥, so 


G = K+ YoR + &, 
and so on. 
Since the sequence of partial sums in (7) are direct summands of G, the 
subgroup @, is a direct summand of @. $G, is a direct summand of oG, G 


2A group, decomposable into groups of rank 1, is called completely decomposable. A group 
is said to be of rank 1, if every finite subset of elements generates a cyclic subgroup (cf. 7). 

*This theorem admits considerable generalizations; its proper formulation will be in terms 
of lattices or semi-lattices. 
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a direct summand of H, so @@, is a direct summand of §. Applying the map 
v, we see that 


(8) ¥OG: = YOR + YOVOKR +... 
is a direct summand of ¥§. Thus there exists a decomposition 
(9) VD = ¥oG, + Gz, 


which defines @, uniquely up to isomorphism. Using (6), we see 
G = KR + ¥oG; + G:. 
It follows from (7) and (8) that 
G, = K + ¥oG,, 
sO we are in accordance with (2). 
Defining §; and §:2 by (4) and (5), we find, applying y¥~ to (9) 


§ = 96+ y'G@: 
9: + D2, 


in accordance with (3). 
Thus (4) and (5) define the required isomorphic map of @ on §, q.e.d. 
We give the following simple application. 


CoROLLARY. Jf @ and § are equivalent complete groups or equivalent 
additive groups of a division-ring, then © and § are isomorphic. 


Proof. Each subgroup U of an additive group of a division-ring with 
characteristic p # 0 is a direct summand, so we can apply Theorem I, if the 
characteristic equals p ¥ 0. In case p = 0, the additive group of a division- 
ring is complete, so we have only to consider equivalent complete groups @ 
and §. Since a subgroup of a complete group is a direct summand if and only 
if it is complete and since the sum of an arbitrary number of complete sub- 
groups is again complete, we can apply Theorem I to obtain the required 
result. 

It has to be noted that this result can also be obtained easily by applying 
the set-theoretical Schréder-Bernstein theorem directly in view of the (up to 
isomorphisms) unique decomposition of a complete group into a direct sum 
of groups of rational numbers and groups of type p* for various prime numbers 
p. 

This corollary can be generalized to more general classes of completely de- 
composable groups, but these are not very interesting since strong conditions 
must be imposed to avoid counter-examples like Example (i). 

The condition in Theorem I requiring that the sum of an ascending sequence 
of direct summands is again a direct summand is also a very strong one. 
It is satisfied, for example, for the class of groups (4) in which every serving 
subgroup is a direct summand, but one can prove that there are already 
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serving subgroups in a free abelian group of infinite rank which are the sums 
of an ascending sequence of direct summands, but nevertheless not direct 
summands themselves. 


THEOREM II. Groups © and §, equivalent by serving subgroups, of which at 
least one has a base, are isomorphic. 


Proof. Since subgroups of a group with a base have themselves a base, 
@ and § are both expressible (up to isomorphisms uniquely) as a direct sum of 
infinite cyclic groups and finite primary cyclic groups. Take decompositions 
of this kind in @ and §. 

@ and § have the same rank, since they are equivalent. Hence the number 
of infinite cyclic summands is the same in both group decompositions. Now 
take the direct sum © of those cyclic summands in @ which are of order 
p* (p and k fixed). We shall show that the number of cyclic summands of this 
type is the same (in the given decompositions of @ and §). © is a direct 
summand of @, ¢@ a serving subgroup of §, ¢S therefore a serving subgroup 
of . Using the well-known fact that a periodic serving subgroup with all 
elements of bounded order is a direct summand, we see that ¢© is a direct 
summand of $. However, since each decomposition of can be refined to a 
decomposition into indecomposable cyclic direct summands, and since any 
two such decompositions are isomorphic, the number of cyclic summands 
in $S, all of order p*, is less than or equal to the number of cyclic summands 
of this type in the decomposition of §, given above. So the number of cyclic 
summands of this type in @ is less than or equal to the corresponding number 
in §. Since the converse is equally true, the number is the same. 

Now the theorem follows by taking direct sums in @ and § corresponding to 
different numbers p*. 


3. Indecomposable groups of finite rank. Let a be a transcendental 
number and put 


a, = a‘ (Sw 3.2. ..+—9h 


We define the group © as the additive group of real numbers 


Jai: a Q ataet+... +a, 
e) — oe ‘tae? aa ’ 
Pi Pp Pn p 
that is, @ is generated by the elements in the right hand side, where the k 
are variable integers and p, pi, po, ... , P» are distinct prime numbers. 


@ is an indecomposable group‘ of rank n (@ is clearly countable and torsion- 
free). 


‘The first indecomposable groups of rank two were constructed by Levi (10) and Pontrjagin 
(12, p. 384). Baer (cf. 7, p. 217) proved that every serving subgroup of the additive group of 
p-adic integers is indecomposable. Erdés (3) gave an example of an indecomposable group of 
rank two, which is essentially the same as our example for the case n = 2, though following 
a different line of thought. 
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Proof. Using the transcendency of a we see that @ only contains elements 
of n + 2 different types, that is, the type 0 (of the infinite cyclic group) and 
the types of a;, do, ..., a, and 


n 
b= _ ai. 
t=1 
Moreover, we see in the same way that only the elements of the serving 


subgroups 


Jay t \ bt 
A, = a ee d $= 
‘pF (1 m) an ps 
have the types of a,(i = 1,2,...,m) and b, respectively. 
Suppose now that @ is decomposable 


G = $+. 


Then each of the groups &%; and B must be contained in either $ or O since’ 
in the opposite case, as the reader may easily verify, both $ and OQ would 
contain elements of, say, the type of a, in contradiction of the fact that 
only the serving subgroup 4%, of rank 1 contains such elements. Say 


bE §. 


The a, are spread over $ and ©, but then they are already contained in §, 
since otherwise 
b=p+4, q # 0, 


in contradiction to b € §. 
Hence 2 = 0 and @ is indecomposable, q.e.d. 


4. Example (ii). Define “‘disjoint’’ groups @,(i = 1, 2, . . .) 
G,~ G, 


where is the group of the preceding section with m = 2. Define 


= {At 


where » is a variable integer. We form the (restricted) direct sums 


2=<> 6, %= >, 
i= i=! 


It is easy to see, using different transcendental numbers, that & can be 
isomorphically embedded in the additive real group. So & is torsion-free and 
countable. 
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R and its subgroup R, are countable torsion-free groups, which are equivalent 
by serving subgroups, but R and R, are not isomorphic. 


Proof. 8; is a direct summand, so certainly a serving subgroup of &. 
Conversely, we can map R,isomorphically in &;: 


by mapping @p in the natural way on §; C @, (where §,; corresponds to the 
subgroup 


of G, defined in §3), and G, in the natural way on G,,; (¢ = 1, 2,...). o& is 
a serving subgroup of &;, since &:/¢8 is torsion-free. Thus Ra nd &; are 
equivalent by serving subgroups. 

To prove that R and §; are not isomorphic, we prove that §, contains no 
direct summand isomorphic to the direct summand @, of &. 

Suppose, on the contrary, that there is a subgroup 


fi 
So* ~ — 
Go ary 
with 
(10) R; = G,* + M. 


Since (putting G, = G) the elements a,/p." are contained in @, for all 
natura! aumbers n, and since “unlimited division” in @,* is only possible 
by powers of /;, it follows easily from the decomposition (10) that all a2/p," 
are contained in J. The b/p" are equally contained in M. So b — az = a, 
and therefore the serving subgroup {a;/p:"} is contained in the direct 
summand J. Hence 


G,C M. 
This is true for all 7 = 1, 2,.... Therefore 


M = Ri 


in contradiction to (10). 


Remark. Actually, we proved the existence of two non-isomorphic groups, 


the first being isomorphic to a direct summand of the second, and the second 
being isomorphic to a serving subgroup of the first. 


| 
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IMBEDDING CONDITIONS FOR HERMITIAN 
AND NORMAL MATRICES 


KY FAN anp GORDON PALL 


1. Imbedding theorems. Let A, B be two square matrices with complex 
coefficients, of respective orders » and m, where n > m. We shall say that B 
is imbeddable in A if there exists a unitary matrix U of order m such that 
U*AU contains B as a principal submatrix. In other words, B is said to be 
imbeddable in A if there exists a matrix V of type m K m such that V*V = J,, 
(= the identity matrix of order m) and V*AV = B. 

For Hermitian matrices, the following result holds: 


THEOREM 1. Let A, B be two Hermitian matrices of respective orders n and m, 
where n > m. Let a1 > a2 >... > a and Bi > B2 >... > Bm be the char- 
acteristic roots of A and B respectively. Then a necessary and sufficient condition 
for B to be imbeddable in A is that inequalities 


(1) a; > Bi On—i+1 S Bm—i41 (1 <1 < Mm) 
be fulfilled. 


The necessity part of Theorem | is well known (2, p. 75). Inequalities (1) 
had already been given by Cauchy (1, p. 187) for real symmetric matrices. 
To the best of our knowledge, no proof of the sufficiency part has been pub- 
lished except for m = 1 or n. Dr. A. J. Hoffman, to whom we described our 
proof in the summer of 1954, kindly sent us an unpublished proof given by 
H. Wielandt in 1953 and based on quite different ideas. 

For normal matrices, we have the following result: 


THEOREM 2. Let A, B be two normal matrices of respective orders n and 
n — 1. Let a, a2,..., @ and B;, B2,..., Bn. be the characteristic roots of A 
and B respectively. Renumber them so that a, a2,..., a, are each distinct from 
Bi, Bo,...,Bg1, while a; = By. forg+1<j<m. Then a necessary and 
sufficient condition for B to be imbeddable in A is that the 2g — 1 points 
1, Q2,... 4 Gg, Bi, Bo,..., Bg—1 im the complex plane shall be distinct, collinear, 
and that every segment on this line limited by two adjacent a;,'s (1 <i < q) 
shall contain one 8,(1 <7 <q — 1). 


This result generalizes the case m = n — 1.of Theorem 1. It may appear 
surprising that a generalization to normal matrices is possible. For one thing, 
the roots are now complex, and since complex numbers have no natural 
ordering, inequalities (1) might not be extendible. 

Received July 6, 1956. Work prepared in part under a National Bureau of Standards 
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We observe that the essential result in Theorem 1 is the case m = n — 1, 
in which case (1) becomes 


(2) a> Bi > a2 D>... D An-1 D Bn-1 D O&.- 


The result for this case is easily extended to the case m < nm — 1. In fact, if 
(1) holds, intermediary sequences of characteristic roots can be inserted such 
that two consecutive sequences are interlaced similarly to (2). Then, by the 
result for m = n — 1, there exists a chain of Hermitian matrices, with orders 
increasing by unity, such that each is imbeddable in the next. 

For normal matrices, Theorem 2 deals only with the case m = n — 1. 
The case m < nm — 1 seems to involve additional complications. If a normal 
matrix B of order m < n — 1 is imbeddable in a normal matrix A of order n, 
it is not true in general that there exists a chain of normal matrices, beginning 
with B and ending with A, with orders increasing by unity, such that each is 
imbeddable in the next. As an example, let A, B be diagonal matrices: 


5 + 2it 
A= diag {0, 1,14,1+ 1}, B= diag" : +S - ams d 


If we take 


2 
,_—_1]1 -2], 
/10] 2 -1 

s 1 


then V*V = J, and V*AV = B; so B is imbeddable in A. By Theorem 2, 
any normal matrix C of order 3 imbeddable in A must have two of 0, 1, 4, 
+ i as characteristic roots and its third root on the segment joining the 
remaining two. But by Theorem 2, B cannot be imbedded in any such matrix C. 
Because of interest in extensions of Witt’s theorem on quadratic forms in a 
field, the following corollary is worth noting: If a normal matrix with char- 


acteristic roots a), 82, ..., 8,1 can be imbedded in one with roots aj, ae, ... , Gn; 
then the same is true of the respective matrices with roots 2,..., 8,1 and 
@2,...,@,. Another interesting fact is as follows: A necessary and sufficient 


condition that all matrices imbeddable in a normal matrix A be normal 
that the characteristic roots of A shall be collinear, or (what is the same 
thing) that A have the form w(J + e“H) with H Hermitian, w a complex 
number and @ real. The sufficiency is evident: 


V*(I + e*H)V = I, + eK, 


as K = V*HV is Hermitian. To prove the necessity notice that if N is 
normal then w,;N + weJ is normal (w; and w2 complex numbers). Hence it 
suffices to consider A = diag{0, 1, a + bi}, b # 0. But if 


k kk 
>. a 
silts } —h c}. . 
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the matrix 
3(1+a+ di) —hk 
a * a 
B= V*AV -| _ hk , | 


is not normal. 


2. Proof of the sufficiency part of Theorem 1. According to a remark 
made above it suffices to consider m = n — 1. Since the properties in question 
are invariant under unitary transformations we need only prove the following: 
If real numbers a,(1 < i < m) and 8,(1 <j < m — 1) satisfy (2), then there 
exist m — 1 complex numbers z,(1 < i < m — 1) and a real number y such 
that the Hermitian matrix 


Py 21 22 23 oo « Syeh 7 

2, 8 O O a 

2 O 6 0 ...0 

(3) » © © & 8 
LL fn—1 0 0 0 a al 








has {a,} as characteristic roots. To show this, we may assume 
(4) ark reas eo... 1 > & 


instead of (2). In fact, if 8B, = a, or a;41, we can choose z; = 0 and then work 
with a matrix of order decreased by unity. Since the characteristic polynomial 
of the matrix (3) is 

J > lz,[° ai 

a- Go _ (A — B;), 

\ j=l A- B,S j=l , 
and in view of the strict inequalities (4), the requirement that {a,} shall be the 
characteristic roots of the matrix (3) is equivalent to 


n—l 


1+ 7: aa. = a; (l<i<n). 


jul Ai B 


Hence it remains to prove the following 
LemMA. Under the hypothesis (4), the system of n linear equations 


Xi Xe Xn—1 
+...4¢— = 


(5) an-h' anh a, — Br 





+x, = a; (1 <qig¢n) 


has a unique solution x, X2,...,X,, and this solution satisfies x, > 0 
(l<k<n-—1). 


Proof. Let A denote the determinant of the coefficient matrix of the system 
(5). Let A, denote the determinant obtained from A by replacing the kth 
column by the column of quantities a; Then we are to prove that A + 0 
and that A,/A is positive for 1 << k <n — 1. 


hm 


rk 
ial 


he 


n ) 
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This will be seen from certain expressions identically equal to A or A,. 
In deriving these expressions we shall regard the a’s and §’s as variables. 
First we write 
(6) 4 = P/Q, 
where 


(7) Q=T[] IT] @-48,), 


l<i<n 14j<n-1 


and where P is a polynomial in the 2m — 1 variables a, and 8,. Since two rows, 
or two columns, of A become equal when two a’s, or two §’s, are set equal, 
every difference a; — a; (l << i<j <m) and 6,— 8, (l<i<j<¢n-—1) 
is a factor of P. The number of these factors being 


(+65 ))=6—m 


P being homogeneous of degree (m — 1)*, we must have 


P2x¢c I] (a; — a;)° I] (8; — B,), 


1< 1< jen 1< i<j<n-1 


where c is a constant. To determine c we may choose 
B,=j(l<j<en—-1), ag =ite(lCign—1), a =0. 


After this substitution, if we multiply by « each of the first » — 1 columns of 
A, then from the complete expansion of the determinant it is easily seen that 
e"—1A4 — 1 as e — 0. In the polynomial P in ¢, the constant term is 
° \2 
cm—1)! JT] @-f)’. 
1< i<j<n-1 


Also, we observe that e*! is a factor of the polynomial Q in e, and that the 
constant term in the polynomial Q/e"~' is 
(n—1) . 2 
(—1r"""@-1)! Tl &—3. 
1< i<j<n-1 


So we have e*! P/Q — c(—1)'-»", as «> 0. Hence c = (—1)§*—*" and 


(8) p=(-1)"™ |] (@-a,): [] (6-86). 
1< 1<j<n 1<t< j<n—-1 
Similarly, we write 
(9) A, = P,/Q, (l1<k<&n-—1). 
where 


(10) @2=I[] I] @-s,, 


le ign 14j<n-1 


jxtk 
and 
Pr = I] (ay — ay) ° I] (8, — B;). 
1<€ 1< j<n 1<1<j<n—-1 
i, joth 


To determine the constant c,, we use the same substitution as above. If we 
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multiply by « each column of A, except the kth and nth, then we get e*-*A,—k, 
as «— 0. One also verifies that 


e*-? P,/Q, — ke, (—1)**-9, e— 0. 
So we have 
(11) P, = (-1)*"* TT (a,—a,)- JT] (8-8). 
1g t< jen 1<1<j<n—1 
1, jth 


From (6), (7), (8) and (4), it is clear that A # 0. Collecting our results 
we obtain, after simplification, 


(12) A/ds=— [] lav— fe) / JI (8; — Bx) (l<ok<n-—1), 
1<j<n—1 
jak 


1<i<n 
which is positive, by (4). This completes the proof. 


From the above proof, it is clear that in the case of real symmetric matrices 
A, B, Theorem 1 remains valid if we require U to be orthogonal in our definition 
of imbedding. 


3. Proof of Theorem 2. We first prove the necessity part. Let A, B 
be two normal matrices of orders m and m — 1 respectively. Let a; (1 <i < x) 
be the characteristic roots of A, and let 6, (1 <7 < m — 1) be those of B. 
Assume that B is imbeddable in A. Then there exists a unitary matrix UL’ of 
order n such that U*AU is of the form 








Ty 21 Z2 23 oo « By] 7 

wy By 0 0 — 

D> 0 Be O a 

(13) U*AU = Ds 0 0 Bs 5 ata 
L Wn—1 0 0 0 = a 


The fact that U*A U is normal can be expressed by 


(14) lz, = |w,| (l<j<n-1), 
(15) 2; 2, = Wy; Oy (lej<en-11l<¢k<n-1l), 
(16) (8, — y) wy = (8B; — ¥) 2; (l<j<n-—1). 


From (15), we have (2, W,) (2 2%) = (2, d,)(w,w,), so among the n — 1 
numbers 2,0, (1 < 7 < m — 1), the non-zero ones have the same amplitude. 
Designating this common amplitude by 20, we have by (14): 


(17) 2, = ef" w, (<j<n-—1). 


We may assume that 2,, 2,41, ..., 2,-1 are all those z,’s which are zero (if 
they exist). If 1 < 7 <q — 1, then by (16), either 8, — y = 0, or 


amp(8; — y) = @ (mod z). 


0. 
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In either case, we can set 


(18) B,= 7+ eb, (l<j<q- 1), 
where 5, is real. Put 
(19) u, =e", (l<j<q-1). 
Then by (17), 
(20) i, =e" w, (l<j<q- 1). 
Introduce the Hermitian matrix of order g 

0 a a) ee 

a, 6b 0 O — 

ee i oe 
and let 
(22) C=y7I,+ e8H, D = diag{B,, Beri, ..., Bp—1}. 


Then according to (18), (19) and (20), we have 


‘ * = 


Let a; > d2 >... > a, be the characteristic roots of the Hermitian matrix J. 
Then by (22) and (23), the characteristic roots {a;} of A (also of U*AU) 
can be so renumbered that 


ja;=yte“a, (1 « 


(24) SI<S9) 
la, = By (q+1<j<ny). 

Now we renumber (i, §2,...,8,-1 in such a way that the corresponding 

bi, be,...,bg-1 (see (18)) are arranged in decreasing order. Then, since 

@;, G2,...,@, are the characteristic roots of H, we have the interlacing 

inequalities 

(25) a1>bh >aza >be >... D> dyit > ay. 


Relations (18), (24) and (25) together express precisely the condition stated 
in the theorem. 

To prove the sufficiency, let complex numbers a, (1 <i <m) and 8, 
(1 <7 <m — 1) satisfy the condition stated in the theorem. Then for some 
complex number y and real numbers 


6; @, G2,... 5 Ag; b,, be 


we have the relations (18), (24) and (25). From (25) and Theorem 1, there 
exists a Hermitian matrix H of order g, with characteristic roots a), de, ... , ay, 
and containing diag{),, be,...,6,-1} as a principal submatrix. Using this 
and the numbers y, 6 appearing in (18), (24), we form the normal matrix 


C = 71, + eH. 
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Let D = diag {8,, Bo+t, oeeg f..23. Then 


[o 


is a normal matrix of order m, and by (24), its characteristic roots are 
@1, @2,..., &. Since H contains diag{b,, be, ... , bs-1} as a principal submatrix, 


so by (18), 
Cc 0 
0 D 


contains diag {8,, 62, ..., 8,1} as a principal submatrix. 

This shows that every normal matrix of order m — 1 with characteristic 
roots 8, (1 <j <m— 1) is imbeddable in any normal matrix of order n 
with characteristic roots a; (1 < i <n). 
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INEQUALITIES FOR SYMMETRIC FUNCTIONS 
AND HERMITIAN MATRICES 


M. MARCUS anp L. LOPES 


1. Introduction. The purpose of this paper is to present two concavity 
results for symmetric functions and apply these to obtain inequalities con- 
necting the characteristic roots of the non-negative Hermitian (n.n.h.) matrices 
A, Band A + B. 

H. F. Bohnenblust has recently communicated to one of the present authors 
a suggestion of H. Samelson indicating that Theorem 2 below can be derived 
from a concavity result on mixed volumes of convex bodies due to Fenchel 
(2). Bohnenblust has also obtained a separate proof of Theorem 2 by different 
methods and we are indebted to him for several valuable conversations 
concerning his results. Theorem 1 in the sequel is believed to be new and we 
show that it actually provides a simple and direct algebraic proof of Theorem 2 
without using the results on mixed volumes. We have not been able to find an 
explicit statement of Theorem 2 in the literature. Both the present proof 
and the unpublished proof of Bohnenblust were constructed before Samelson 
observed the connection with Fenchel’s result. 

In §2 we state and prove the two results on symmetric functions. In §3 
these results are applied to obtain inequalities connecting the characteristic 
roots of n.n.h. matrices. Theorem 4 extends the classical Minkowski 
determinant inequality to all of the coefficients in the characteristic polynomial 
and Theorem 5 extends a recent concavity result of Fan (1) for determinants. 


2. Results on Symmetric Functions. Let (a) = (a:,...,4a,) bea set 
of real numbers and let E,(a) denote the rth elementary symmetric function 
of (a), l<r<n: 

r 
Efay= 2 IIa, 


1<ii<...<iregn jel 
where the summation extends over all (7) choices of i; <...< i, from 
1,...,#.Ilfr = Othen E,(a) = 1. If (a) = (a;,...,a,) and (6) = (b;,...,5,) 
then let (a + 5b) = (a; + b1,...,a, + 5,). We confine our attention here to 
sets (a) of non-negative numbers at least r of whose elements are assumed 
positive. If there exists a number X such that 


a,;=db,, [i 2 eer, 


we shall write (a) ~ (b). If this is not the case we indicate it by (a) ~ (6). 


Received May 14, 1956. Part of this research was completed under U.S. Air Force contract 
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The set a), ... , @j-1, @j41,... , @» will be denoted by (a’,). We shall also use 
the abbreviation 


THEOREM 1. /f 1 <r <n then 
E,(a + 5b) > E-(a) + E,(bd) 
unless (a) ~ (b) or r = 1 1m which cases we have equality. 


Proof. We use an induction on r. If (a) ~ (6) we clearly have equality, 
hence assume (a) ~ (b). For r = 2 the result follows from 


¥ (a3 4,-1.0) 
E.(a + b) — E,(a) — E.(6) = —“-— a, 


220 (ar +b) Dard bi 


Hence assume r > 2. We first note that 





(1) > a, E,-1 (a’;) = r E,{a). 


i=1 
Now 
E,(a) = a, E,_; (a’;) + E; (a’,), 
and summing on 7 we have 


nE,(a) = > a, E,-1(a',) + > E,(a’,). 
i=1 


i=l 


Using (1) it follows that 


(2) >» E,(a’;) = (n — r) E,(a). 
t=1 
Now write 


E, (a) — E, (a’;) = a; E,-, (a’;) 
= a,E,; (a) — a Ey (a’;) 


summing over i and using (2) we have 





rE,(a) = >) a, E,-.(a) — > a E,-2 (a’,), 
t=i i=1 
Ea) Ue “ »E,-2(a’,)\ 
3 —t = — ant at S582 
(3) E,-1(a) 2% dai E,-1(a) § 


-4da- Soe 


PT tal i=1 a,+ E,:(a’,)! i 





Vv 


use 
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From (3) we calculate that 
@ = E,(a + b) — E,(a) — E,(6) 
1s) a bi __ a + by 


? j=l lon F Ee’) “s5 + E,. 1(b’;) ~ a, +b, +E, ia’, +8)5° 
We make the induction hypothesis that the inequality holds for r — 1. Since 
r > 3 we have 
E,-:(a’; + b’ ,) > E,-1 (a’ ,) + E,- 1 (B’ 5) 
unless (a’,;) ~ (b’,) in which case we have equality. We first consider the case 
for which there exists an i such that (a’,) ~ (b’,); then it follows that 








~ f a’ Bt 
o>72 ry 1(a’;) oS b, 1 + E,. 1(0's) 
(4) = (a, + b,)° = { 
a, + 6, + E,1(0's) + E,a(0')! 
1< (a, E,_ -1(b';) — b, E,_:(a’,)) F 








ss re (a, + E,1(a’,)) (6, + E,- CAG + b, + E,1(a’,) + E,1(0’,)) 


Now in case (a’,) ~ (0’,) for each i then the first inequality in (4) is equality. 
To resolve this case suppose 


(a’s) = A; (0',) 
for each i. Then 
(a, E,1(b's) — by E,1(0's))* = (as — py)’ wi 
where 


ao, = E,_:(0’ ;) aa 0 
since at least r of the (}) are positive. But on the other hand a; = A, 5; implies 
(a) ~ (6) contrary to assumption. 
Hence in this case the second inequality in (4) is strict and the induction 
is complete. 
THEOREM 2. If (a) and (b) are sets of n positive variables then for1 <r <n 
(5) E}!" (a+b) > E;” (a) + E;” (6) 
unless (a) ~ (b) or r = 1, in which cases we have equality. 
To prove (5) we first establish a preliminary 
LemMA. If (a’,) ~ (b’,) and (a) ~ (b) then the inequality (5) holds. 
Proof.' Set 
ta, = b,, ¢=mi1,..., n— 1 
(t+ dja, = b,, 60. 


1The authors are indebted to K. Hoechsmann for simplifying the original proof of this 
Lemma. 
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From 
E,(b) = E,(b’,) + b, E,-1(b’,) 


we conclude that 


E,(b) = t’ E,(a',) + (¢ + 6)t’—' a, E,-1(a’,) 
= t’ E,(a) + 6 t’-' a, E,-1(a’,). 
Set 

g(6) = E}” (a+ 6) — E,'” (a) — E}” (8). 

g(8) = ((¢+ 1)’ E,(a) + 6(¢ + 1)" a, E,-1(a’,))” 


“7r—1 


— E,"" (a) — (t' E,(a) + at’ a, E,-1(a’,))"”’. 
Since g(0) = 0 it will suffice to establish 


5g’ (6) > 0 


for 5 0, for then g strictly increases from 6 = 0. First note that 





r(E,(b) E,(a + b))"-” 


The sign of this expression is the same as the sign of 


(8) ~ a, E,-1(an) (¢ + nee Eer (b) - Co . (a + b) 


(¢+ 1)’ E,(b) — t’ E,(a + 6) = (t+ 1)’ E,(a) + bt’ a,E,_1(a,) 
t’ (¢+ 1)’ E,(a) + 6(t + 1)""a,E,_1(a,) 
t’—"(t + 1)" 6 a, E,_, (a,). 


Hence g’ (6) has the same sign as 6. 


The proof of (5) will be done by a double induction on r and n. The case 
r = 2 and » arbitrary can easily be verified directly. We thus make the 


induction hypotheses that (5) holds for 


(i) r — landalln >r-—1, 
(ii) r and m — 1. 


Since we know r = 2 and r = n (Hdélder’s inequality) it will suffice to prove 


(5) for r, m using (i) and (ii). 
By the Lemma we can assume (a’,) ~ (b’,) and also (a) ~ (b). Set 
=a, yy =b, x= Rceosd, y= Rsing. 
Let f(x, y) = g(R, @) be the left side of (5) minus the right side: 


(6) g(R, ¢) = (E,-1(a’, + b’,)R? + E, (a', + b’,))'"”" 
(E,_1(a’,)R? cos? @ + E, (a’,))!/" 
(E,_:(b’,)R? sin? @ + E, (b’,))'”’. 


| 


We show first that there exists a sufficiently large Ry such that 


g(R, ¢) >0 





for 


By 


At 


we 


Re 


an 


lo! 
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for 0 < @ < 2x and R > Ro». Set 
C,=E,(a@,+ 0’), 
A,=E;(a’,), 
B, = E, (b’,). 


By factoring an R? from each bracketed quantity in (6) it is clear that 
g(R, ¢) > 0 


if and only if 


c,\'”" wre” s,, B\” 
(c,. + c:) _ (1 ,-1cos $+ 4:) - (s... sin’ @¢ + B;) > 0. 
At @ = 0 and ¢ = 2r this is clear. By setting 
h’ (¢) = 0 

we easily calculate that at a minimum for h(¢@) 

cos’ 6 @ Aen "hen + Bey 

sin’ @ = By) (AY + Ba”). 
Replacing these values in h(@) and using (i) we see that 


h(¢)>0 


and hence 


f(x,y) = g(R, @) > 0 
for R > Ro. Thus if f(x, y) is to have a non-positive minimum it must occur 
in the interior of the circle R = Ry where the partials satisfy 
(7) 
The equations (7) become 
xf (x? + y?)Cia + C,) 9°?" Cri — (2*A a + A) ?""A,-1} = 0 
yi C(x? + y*)Cua + °°?" Cua — (9°B pa + B,)"” Bp} = 0. 


lfx = y = 0 we have the result by (ii). If both the curly bracketed quantities 
in (8) vanish then the result follows easily from (i) by direct substitution. 
The less trivial case is, say, y = 0 and x # 0. We must then show 


(8) 


f(x,0) >0 
for the value x given by 
1(r—1) A ine—y _C 
cle _ A ans r 
‘ 1 1 r—1 
= —70-) _ are 
pk i oe 
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Replacing this in f(x, 0) we have 


f(x,0) = 
l/r l/r 
1/(r—1) (r—1) A /(r—1 Cc 1 —1) 
cc = oS _ a = —A,A4 
41 r—1 r—1 l/r 
I-11) OF 17-1 —\—— 7AYG-) {,Vve-p — B, 
c 1 1 
r—1 — 4ir—1 r—1 — 4ir—1 
l/r 1/ 
C, A, LC, A, ; 
= Ci-» or Ay1 i gtr _ C,-1 —_ Art Bi” 
_ r—1 17(r—T) 17(r—1 — 42 1 17(r—T) i7(r—-1) jr - 
C— a A r—1 [ Ct —A r—1 


At this. point we use the inequality of Theorem 1. Then 





B, ad 1/(r—1) 1/(r—1) \(r—1) fr l/r 
f(x, 0) > B (C,4 -A r—1 ) = B, > 0. 
r—1 


The last inequality follows from (i). This completes the induction. 

To indicate briefly that the concavity of E,'/"(a) can be obtained from 
Fenchel’s result we set K,; equal to the segment from 0 to 1 on the ith axis 
(¢ = 1,...,m) and we let Q be the unit cube in n-space. For a = (a;,..., dn), 
a; > 0, form the convex sum 

n 
K, = >. a, K,. 


i=1 
For A, » non-negative numbers form the volume 
n 
VAKe+uQ) = 2) b,0% w’. 
j=90 
The coefficients b, are the elementary symmetric functions of (a) to within 
a constant factor and are called the mixed volumes. It is easy to check that the 


concavity result of Fenchel? reduces in this case to the statement that E,'/"(a) 
is concave. 


3. Applications to Matrices. The following result is known (3; 4): Let 


Figs, ..-,He) @ By (Aes, Sa), --: (Axx, X,)), 
where A is a n.n.h. matrix with characteristic roots 0 < a; Kawi (@ = 1,..., 
n — 1). Then 
max f = (*\(> as-is) 
Tr j=l k 
min f = E,(a1,..., a) 


where both max and min are taken over all sets of k o.n. vectors in the unitary 
n-space.* 





2A discussion of mixed volumes and a conjecture of this general concavity result may be 
found in the book by T. Bonnesen and W. Fenchel, Theorie der konvexen Kérper (Chelsea, 
New York, 1948), p. 93. 

’The paper of Ostrowski contains only the value of min f 
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THEOREM 3. Let A,B,A + B beu.n.h. with characteristic rootsO < a; < aias, 


0< Bi < Bir, OC Ai < Aggi for 2 


l<r<gkaqn 


Ee" (vs, .. 


Proof. Let 
such that 


Set 


Then 


E}/"() = E}’"(c) = 


(Ax,;,X,;) = 


a;, 


sl/r 


= Le 


> E}’"(a) + E)” 


(Bx ;, x3) = b;. 


(a + b) 
(b) > E’"(a) 


x" + >> p,(A) x”? 
j=l 


Whiecey n— 1, respectively. Then for 
-» XD > Ey" (ea gees ay) + E;"" (Bi peeves Bx) 
See, x, be an o.n. set of characteristic vectors of A + B 
cy = ((A + B) xy, x5) =X, oe ae k 


+ E}!"(g). 


be the characteristic polynomial of A. By setting k = m in Theorem 3 we 
obtain the following extension of the Minkowski inequality to all the coeffic- 


ients in the characteristic polynomial: 


THEOREM 4. 


If A and B are n.n.h. then forl cr <n 
lp, (A + B)|'/" > |p, (A)|'”” + |p, (B)|'””. 


The extension of Fan’s result is contained in 


THEOREM 5. 
[“arae 


If A and B are n.n.h. and w + ¢ = 


p-(wA + oB)| > |p,(A)|* |p-(B)|". 


Proof. As in the proof of Theorem 3 


l, w > 0, o > O, then for 


E}!"(d) = E}!"(w(a) + o(b)) > wE)!" (a) + cE!" (b) 
> wk}! (a) + cE}! (8) > Et!" (a) Et’"(8). 


The result follows by taking rth powers. 


THEOREM 6. 


(9) 


Under the same hypotheses as Theorem 3, 


l—r k 
ae, Sell 
> E, (a 


+ E,(B; 


) E,(A1,..-» Az) 
k l-, 
rn b> — 


grees 


fui 
k 


(> | ae 
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Proof. We again choose x;,...,x, to be Rk o.n. characteristic vectors of 
A + B corresponding respectively to the characteristic roots A;, . . . , Ax. Then 
((A + B) x, x,) =, ee 


and we have 


E,(A1,..-, Ax) = E,(((A + B)x, x)) = E,((Ax, x) + (Bx, x)) 
> E,((Ax, x)) + E,((Bx, x)) 


k ) ( k a a 
~ an— Jt 
> B,(a)(, -1) \a5 
, k ( k Bp us) r 
+ E,()( id ) > ; 


COROLLARY 1. Under the same hypotheses as Theorem 4 
-+( ” \|_p(A +B) | 5 _|o(A)] , _lor(B)| 
- (" ) 24, + B) ? (tr A)’ + (tr B)’ 


Proof. Set k =n in (9). We immediately conclude from Corollary | 
that 


and (9) follows. 


n -1 
det (A + B) > tr (A + By( ee» Il ( n ) +) 


r=2 \T — ] 
where 
= _|p-(A)| _ler(B)| 
= (tr A) + (tr By 
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ON THE MAXIMUM PRINCIPLE OF KY FAN 
M. MARCUS anv B. N. MOYLS 


1. Introduction. In 1951 Fan (1) proved the following interesting extreme 
value result: Let Ai,..., Am be completely continuous operators on a Hilbert 
space . For o = 1,2,...,m let doi > Ao.i41 be the characteristic roots of 
A,* A,. Then, for any positive integer k, 


k k m j 
(1) max > (U,A1... UgAu Xi, %1)| = ) (11 si) . 


i=l o=l 
k m 
(2) max det{(U:A1... UnAnX;,%;)} |" = I] TT Res 
i, jml,2 k i=l om 
where both maxima are taken over all unitary operators U,,..., U, and all sets 
of k orthonormal (o0.n.) vectors. 
Fan proved (1) for m = 1 and then applied an inequality of Pélya (7) 


and a recent result of Horn (3) to obtain the theorem for m > 2. This result 
is a generalization of a result of von Neumann (10), which states that 7/ 
A and B are n-square complex matrices with singular values (12) a; > ais, 
8, > Bir (¢ = 1,2,...,” — 1), then 


(3) max|tr(UAVB)| = > a; Bi, 


i=1 
where the maximum is taken over all unitary U and V. 

In this paper we shall confine our attention to the case of finite matrices 
We shall show that both (1) and (2) are special cases of a general maximum 
result for compound operators (Theorem 3). As applications we obtain 
inequalities analogous to those of Horn (3) and Ostrowski on the singular 
values of a product. An inequality of S. N. Roy (8) (later published with a 
different proof by B. Sz. Nagy (9)) is a special case of Theorem 3 and Ostrowski's 
inequalities (6) connecting Schur-convex functions of singular values and 
characteristic roots. 


2. Fan’s first result. Before proceeding, we point out that for finite 
matrices (1) follows immediately from (3). Following Fan we need only prove 
(1) in the case m = 1. 

Let x;,...,x, be an o.n. set of vectors, k < n, and let P denote the ortho- 
gonal projection into the subspace spanned by x;,.. . , X;. Then 
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313 











314 M. MARCUS AND B. N. MOYLS 


‘ k 
| Do (UiA1 %4, X41) | = | 24 (UiAr xy Pxs)| 
| i=l ' ry 
= | >> (PUjA1 x4, x1) 
i=1 


tr(PU;A)) 


max |tr(VPU,A1) 
Uu1.V 


< , Pi Vv Kis 


i=l 


A 


by (3), where the p, are the singular values of P. Since p, = 1 forj = 1,..., k, 
and p,; = 0 for 7 > k + 1, (1) follows for m = 1. 


3. Properties of the Grassmann algebra. Let x,(i=1,..., r) l<r<n 
be vectors in unitary m-space V,. Then 


(4) Se GH, ABe A .-+ ABs 


denotes the Grassmann exterior product of the x,; it is a vector in V, where 
m = (;). If A is a linear transformation of V,, the rth induced compound of 
A is a linear transformation of V,,, defined by 


(5) CLA A... ASe @ AMA... A ade 


The following properties, which we list for later reference, can be found in 

(11, pp. 63-67): 
(i) Gr A... AX Wi A.~-- A Ve) = det { (x4, V5) } 4, 501.....7- 

(ii) C,(AB) = C,(A) C,(B), C,(A-") = C,-“"(A), C,(A*) = [C,(A)]*, 
where A* is the transposed conjugate of A. 

(iii) The characteristic roots of C,(A) are the (7) products of the character- 
istic roots of A taken r at a time. 

(iv) If A has any of the following properties, so does C,(A): non-singular, 
normal, Hermitian, non-negative Hermitian (n.n.h.), unitary. 

To prune some of the foliage of indices usually necessary in discussing 
these objects we introduce some notation. The set of (+) distinct sequences of 


positive integers i,,..., 1%, satisfying 1 < i; <... < i, < k will be denoted 
by Q;,, and a typical such sequence will be called w. If x:,..., x, is a set of k 
vectors in V,, and w is the set {7;,..., i,} in Qy,, we set 

Xe = X%i,N..- AX; 
If (a) is a set of m numbers a,,..., a,, E-(a) = E,(a;,..., a,) will denote 


the rth elementary symmetric function of (a). 


4. Results.The following theorem is a generalization of the result in (5): 


THEOREM 1. Let H be n.n.h. with characteristic roots hy > hj, j = 1, 
n — 1. Let f(t) = t', s real, and 































THE MAXIMUM PRINCIPLE OF KY FAN 





o(x1,..-,%) = D, Sl(C-(H)xe, xe)), 


} @tQir 
where 1 <r<k <n. Then, tf s > 1, 


(6) max @ = E,[f(h),..., (hy) |; 
} . ‘. 
and ifO <s <1, 
(7) min @ = E, [f(ha—e41), ~~~ » f(a) I; 
where the maximum and minimum are taken over all sets of k on. vectors 
tin ss (tee ve 
b Proof. For the case s = 1, the theorem is proved in (5). The present proof 
_ is an application of this result to f(77). 
PI 
First note that! if ||x/| = 1, 
<n \ (8 {| (Hx, x)| < (f(A) x, x). 
| Ror, t8 Sicccced x, are o.n. characteristic vectors of H corresponding to 
iS cone e h,, then 
' f| (Hx, x)] = | h, \(x, x | 
ere fl | =, > j i) 
| of ™ « . 
< > fay) (x, x,)|° 
‘ j=l 
= (f(H)x,x 
; Secondly,” 
- (y f(C,(H)) = C,(f()). 
, For, if U is a unitary matrix such that U*HU = D, diagonal, 
(C,H) = f(C,(UDU*)) = C,(U) f(C,(D)) C(U* 
= C,(U) C, (f(D)) C,(U*) = C, f(UDU*)) = C,f(a 
er- lo establish (6) we observe that by (8), (9) and the remark at the beginning 
of the proof, 
so ' &< > (f(C(A)) xe, %) = D> (C,(f(H)) Xe, Xe 
j @tQkr weQkr 
ng < E,(f(in), ...,f(he)). 
of Equality can be achieved by choosing x;, . . . , x, to be an o.n. set of character- 
ed istic vectors of H corresponding to hy, ... , h,, since by (i), 
i 
= >> f(det{ (Hx,,, x;,)}), ste 1,...57, 
} @tQkr ‘ 
= D> fidet{hi, 644)) = D ATI i.) 
te weteQkr weQe- s= 1 
—_— = E,(f(hi),..., f(y). 


| . . . 
(7) is proved similarly: 
' —_ . 
'This inequality holds for any continuous convex function f defined on the spectrum of H 
; rhe inequality is reversed if f is concave. 


*This equality holds for any function f defined on the spectrum of H and satisfying the 
relation f(xy) = f(x) f(y). 
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Theorem 3 is our main result. Theorem 2 is a special case. However, it is 
the first step in the proof of the general theorem, and its proof appears to be 
of some interest in itself. 


THEOREM 2. Let A be an arbitrary n-square complex matrix with singular 
values a; > aiui(t=1,...," — 1). Let 


o(x1,...,%;U) = > (C,(UA ) Xu, Xe) 


@tQkr 
wherel<r<k <n. Then 
(10) max @ = E,(a;,..., Ox), 
where the maximum is taken over all sets of k o.n. vectors x\,...,3 x, and all 


unitary U. 

Proof. Since A = VH, where V is unitary and H = (A*A)?! is n.n.h., we 
may without loss in generality replace A by H. Set y, = U-'x,, (j = 1,...,h). 
By the notation © @C,(A) we mean the direct sum of C,(A) taken (*) times; 
and similarly for >> ®@x,. Since > ®@C,(H) is n.n.h. by (iv), 

¢ = | Dd (ve(M)x%e, Cu) xe) 


| weQkr 


(x @®C(M) > @x, > Ox) 


A 


(=< @cihH>dY ex, © Ox)! 


(LD @CKH) DL Oy Le Oy)" 
} } 
| > (C,(H) Xe, x) | | = (CED ym 36) | 


#tQkr @tQks 


< E,(a, ... , a). 


The last inequality follows from (6) applied to each of the square roots. 
Equality holds when the x, are suitable eigenvectors of H and U is the 
identity matrix. 


THEOREM 3. Let Ai,..., A, be arbitrary n-square complex matrices with 
singular values ae;, where ae; > Mi41 (1=1,..., 2-1; e¢21,..., m). 
Let 


o(x1,...,%23Us,..., Un) = | D> (C(U1A1.. . Um Am) Xs Xe) |; 


@¢Qkr 


where 1 <r<k <n. Then 


(11) max @ = eA TI] Ghass es fia.). 
1 


where the maximum is taken over all sets of k o.n. vectors x, 


oe x,, and all 
unitary matrices U,,..., ws 


oe 





ve 


2). 
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Proof. For a fixed U2,..., Um, let Bi > B2 >... > 8, be the singular 
values of B = A,(U2A2)... (UmAm). By Theorem 2 
(12) @ < E, (81,..., Bx). 
The singular values of U,A, are the a,, (i = 1,...,m). By a result of 


Ostrowski (6, p. 283, equation 106a), 


E,(B:, a | Bx) < ra I] Gely +++ I] aa) . 


There remains to show that the right side of (11) is taken on for a suitable 
choice of the x’s and U’s. As in Theorem 2 we may replace each A, by 


i 
H, = (A,* A,)?. Let %e1,...,Xen (6 = 1,...,m) be an o.n. set of character- 
istic vectors of H, corresponding to ae, .. . , @en. Choose the U, so that 
Us %ei = Xe-1. Res 


U; Mii = Xmty 


ff ae n. Set 
Xow = Xety Rese Xet, 
and 
r 
Aw = I] Ae 4; 
j=l 
where w is the set {2),...,7,}. Then 


| > (CU. . . Un m) Xe» Xe) 


@tQér 
= | > Ome C.(UiWA, eee a Xm—1.09 Xen) 


= > Chass + oes Med 


m m 
=Z (Ti Agi ess I] sie) 
. "i 
= BA Th enn... I] an) 
c= o- 


This completes the proof of the theorem. 


Remarks. 1. By setting r = 1 in (11) we obtain (1); by setting r = k we 
obtain (2). 

II. Theorem 3 does not follow immediately from (1) applied to the 
compound because the-lexicographic ordering of the eigenvalues does not 
necessarily correspond to the ordering by magnitude. 

III. Let A,, o=1,...,m, have eigenvalues },;, ordered so that 
Neil > |Ac.ig:1] (2 = 1,...,"— 1). We can find unitary V, such that 
V,* A.V. = T,, triangular, with the diagonal elements ordered by absolute 
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magnitude. Then if U, = V.1 V.* (6 = 2,...,m), Ui = V, UVi* where U 
is the diagonal matrix with 
exp (- > arg rs) 
o=! 


in the ith row and ith column; and if x, = V,e;(j = 1,..., k), where e 
is the unit vector with 1 in the 7th position; then 


>> (C.(Ui1A1.. . UmAm) Xe, Xe) | = eA T] oe | ral). 
= =) 


'weQer 


It follows from Theorem 3 that 


(13) e( TI ahs» TY hel) < eA TI —— aa) } 


o=l =1 o=l 


The case m = 1, r = k is Weyl’s inequality. Actually, (13) also follows easil\ 
from Ostrowski’s discussion of Schur-convex functions. 


5. Applications. Let X = AB, where A and B are arbitrary n-square 
complex matrices. Consider the convex sum 
VY = oX*X + 6XX*, 


where ¢o+6=1, o>0, 6>0. Let @ > 1, a; Dans, Bi > Biri 
(i= 1,...,m—1) be respectively the non-negative square roots of the 
characteristic roots of Y and the singular values of A and B. In the Theorem 4 
we shall use the following concavity property of the elementary symmetric 
functions :* 


If (a) and (b) are sets of n non-negative numbers and 1 < r < n, then 
Ey" (a + 6) > Ey’"(a) + Ey’"(6). 
THEOREM 4. ForO0Q <s<landl<r<gk<n, 


E(¢s’,- +» Goat) 
> (2eratyr-»( l On 941 611) 
d= 


Proof. Let x:,...,%, be an o.n. set of characteristic vectors of Y corre- 
sponding respectively to 


2 


x 
~ 25 25 é 2 2 
E+(Bn'; sey Bn—e+1) E;(aq'; sees On e41): 


2 


Qny+++5 bn—e+1- 
Then 


’This property follows from a concavity property proved by W. Fenchel (2). This inequality 
also foliows directly as a corollary of a similar property for symmetric functions proved by 
Lopes and Marcus in (4). 


| 


oa ~~ 











Th 





— ee 


i 


, 
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ae" (Gn's + +++ Orenes) = Ey!"[(¥xs, x5)*] 

= Ey!"[fo(X°X xy xj) + (XX"x,, x))}"] 

> Epl"[2"fo"(X°X xy, x4)" + 8(XX" xy, x,)"}] 

> QE [o"(X°X xy, x5)"] + Ep!"[8"(XX"x, x,)"]} 
2°" fo" Ey!"[(X"X xy x4)" + 8° Ey"[(XX"* x,, x5)'}}. 


It follows from the Hadamard determinant theorem that 


r 


E,{(X"X x,, x;)"] = } (TT x*x x1. 24)) 
1S ti<...< ire 


t=1 


> D& (CAX"X) x, x)’. 
weQer 
Thus 
sl/r 2s 2s s—1 J 7* a . 
ge on_rs1) > 2” o yz (C,(X°X) ay Xe)" | 


, io J ry* a ' 
+2°"3 p> (C(XX") xu, Xu)" f 
)isr 
= 2°) D (C,(A*A) C,(B) te, CB) x)*t 


l/r 


+2" 3") Do (C,(BB") C,(A*) x, C,(A") x)"t 
r 2/r \ 
>? o Th at-y:) 1 (C.(B*B) sey X0)"f 
j= 
: , r : 2/r - | 1 
+2 : (TI ps1) ‘ } (C,(A A ) Xa Xu) ( 
Jor s—1 II 8 nas Ei” 2s 2s { 
> o) o aS SoH! “rT (Bn peer » Br—e+1) ( 
j s—1 os—1 : 8 ia l/r; 28 2s ( 
+ 6)2 6 TT Bess E, Om p++ + yn r+1) (- 
j= 
The last inequality follows by Theorem 1. The result follows by a classical 
inequality and by taking rth powers of both sides. 
We remark that when g is zero, o” is to be taken as 1. 


Theorem 4 may be used to relate the characteristic polynomials of the 
matrices involved. For example, 


l/r 


/r 


Ip-((ABB"A*)*)| > 2° YT] an ss |b -((B"B)')|, 
where 
x"+ > p,(M) x” 
r=1 


is the characteristic polynomial of the matrix M. 
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In (9) B. Sz. Nagy proves that if A and B are complex n-square matrices, 
then 
(14) On Bn < Au] < a Bi, 


where a; > a@is1, 8; > Bi41 are the singular values of A and B, and X,, 
Ay] > |Avuil, are the characteristic roots of AB. By Theorem 3, 


(15) = * Ay ) < E,(a; Bi, ..., ax Be), 


for l <r <k <n. By setting r = k = 1, we obtain the upper inequality of 
(14). If a, 8, = 0, the lower inequality is trivial. Otherwise A and B are non- 
singular. The characteristic roots of (AB)-' are the A,“', and the singular 
values of A~' and B~ are the a;“' and 8;~'. Hence 


(16) E,(|An| : ae | ln k+1) ') < E,(a,' Bz", “** »@ieenes Bn a Y 


Again setting r = k = 1, we obtain the lower inequality of (14). 

Both (15) and (16) have immediate generalizations in two directions: 
first, to a product of more than two matrices, and second, to the more general 
class of Schur-convex functions. 
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